SUBSTITUTE FORM PTO-I; 



U S DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFJCE 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY'S DOCKET NUMBER 
10737-006001 



U.S. APPLICATION NO. (If Known, see 37 CFR 

"09/914665 



INTERNATIONAL APPLICATION NO. 
PCT/E POO/02064 



TITLE OF INVENTION 

RETROVIRAL EXPRESSION VECTORS ON THE BASIS OF HERV-LONG TERMINAL REPEAT SEQUENCES 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1 . K This is a FIRST submission of items conceming a filing under 35 U.S.C. 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3. □ This is an express request to promptly begin national examination procedures (35 U.S.C. 371(f)). 

4. □ The US has been elected by the expiration of 19 months from the priority date (PCT Article 31 ). 

5. 13 A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. K is attached hereto (required only if not communicated by the International Bureau). 

b. O has been communicated by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. ^ An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

7. 13 Amendments to the claims of the International Application under PCT Article 19(35 U.S.C. 371(c)(3)) 

a. □ are attached hereto (required only if not communicated by the International Bureau). 

b. □ have been communicated by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. ^ have not been made and will not be made. 

8. □ An English language translation of amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 

9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

10. 13 An English language translation of the annexes to the International Preliminary Examination Report under 

PCT Article 36 (35 U.S.C. 371(c)(5)). 

Items 11 to 16 below concern other documents or inforniatlon included: 

11. lEl An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is 
included. 



12. □ 



13. 13 A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

15. O A change of power of attorney and/or address letter. 

16. □ Other items or information: 

□ 
□ 
□ 
□ 
□ 



CERTIFICATE 




INTERNATIONAL APPLICATION NO. 
PCT/EPOO/02064 



ATTORNEY'S DOCKET NUMBER 
10737-006001 



17. ^ The following fees are submitted: 

Basic National Fee ( 37 CFR 1.492(a)(1)- (5) ): 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO $1000 

International preliminary examination fee (37 CFR 1.482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $860 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO but 
international search fee (37 CFR 1.445(a)(2)) paid to USPTO $710 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

but all claims did not satisfy provisions of PCT Article 33(1 )-(4) $690 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

and all claims satisfied provisions of PCT Article 33(1 )-(4) $100 

ENTER APPROPRIATE BASIC FEE AMOUNT 



Surcharge of $130 for furnishing the oath or declaration later than □ 20 □ 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



Number Filed 



Number Extra 



Independent Claims 



MULTIPLE DEPENDENT CLAIMS(S) (if applicable) 



+ $270 



TOTAL OF ABOVE CALCULATIONS = 



□ Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above are 
reduced by 1/2. 



SUBTOTAL = 



Processing fee of $130 for furnishing the English Translation later than □ 20 □ 30 
months from the earliest claimed priority date (37 CFR 1.492(f)) 



, TOTAL NATIONAL FEE = 

Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28. 3.31 ). $40.00 per property + 



TOTAL FEES ENCLOSED = 



Amount to be 
refunded: 



Charged: 



a. lEj A check in the amount of $878.00 to cover the above fees is enclosed. 

b. □ Please charge my Deposit Account No. 06-1050 in the amount of $0.00 to cover the above fees. A duplicate 

copy of this sheet is enclosed. 

c. lEl The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 06-1050. A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive 
(37 CFR 1.137(a) or (b) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 



Y. Rocky Tsao 

FISH & RICHARDSON P.O. 

225 Franklin Street 

Boston, Massachusetts 02110-2804 

(617) 542-5070 phone 

(617) 542-8906 facsimile 



SIGNATURE- /J 



Y. Rocky Tsao 



REGISTRATION NUMBER 



20309513.doc 
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Attorney's Docket No.: 10737-006001 / P13419-DrB/la 

09/914665 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant : Christine Leib-Mosch et al. 
Serial No. : Unassigned 
Filed : Herewith 

Title : RETROVIRAL EXPRESSION VECTORS ON THE BASIS OF HERV-LONG 

TERMINAL REPEAT SEQUENCES 

BOX PCX 

Commissioner for Patents 
Washington, D.C. 20231 

PRELIMINARY AMENDMENT 

Applicants submit herewith a Sequence Listing in computer readable form as required by 
37 CFR §1.824. In addition, applicants submit a substitute Sequence Listing as required under 
37 CFR §1.823(a) and a statement under 37 CFR §1. 821(f). 

Applicants respectfully requests entry of the paper copy and computer readable copy of 
the Sequence Listing fded herewith for the instant application. Furthermore, applicants request 
entry of the following amendments. 

In the specification : 

Replace the original Sequence Listing with the substitute Sequence Listing filed 
herewith. 



In the claims : 

Amend claims 3-7, 9-12, 14, 15, and 20 as follows: 

~3. (Amended) Vector according to claim 1 wherein the whole LTR region, the U3 
region, or the R and U3 regions are derived from a human endogenous retroviral nucleotide 
sequence.-- 

CERTIFICATE OF MAILING BY EXPRESS MAIL 
Express Mail Label No EL69675256IUS 



I hereby certify under 37 CFR §1 10 that this correspondence is being 
deposited with the United States Postal Service as Express Mail Post 
Office to Addressee with sufficient postage on the date indicated below 
and is addressed to the Commissioner for Patents, Washington, 
DC 20231 . 

Date of Deposit ] 



Signa 



Typed or Printed Name of Person Signing Certificate 
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-4. (Amended) Vector according to claim 1 wherein said nucleotide sequences 
encoding one or more proteins or elements of therapeutic and cytokin peptides are selected from 
one or more the group consisting of marker genes, therapeutic genes, antiviral genes, anti-tumor 
genes, and cytokin genes.— 

~5. (Amended) Vector according to claim 1 wherein said cell-specifically controllable 
promoter region is derived from the LTR region of a cell-specifically expressed endogenous 
human retroviral nucleotide sequence.— 

—6. (Amended) Vector according to claim 1 wherein said human endogenous 
retroviral cell-specifically controllable promoter sequences are selected from one or more 
promoter sequences of HERV families of the group consisting of HERV-K, HERV-H, HERV-E, 
HERV-L, HERV-T, HERV-R, HERV-I, HERV-P, ERV9, HERV-W.- 

-7. (Amended) Vector according to claim 1 wherein said promoter region besides the 
TATA box additionally comprises recognition and binding sites for regulatory proteins.- 

~9. (Amended) Vector according to claim 1 wherein said vector is a promoter 
conversion vector comprising a 5' LTR portion having the structure U3-R-U5, one or more 
sequences selected from coding and non-coding sequences, and a 3' LTR portion comprising a 
U3 region which is partially or completely deleted wherein the deleted U3 portion is replaced by 
a cell-specifically controllable promoter region from a HERV LTR sequence, followed by the R- 
U5 region.— 

— 10. (Amended) The mRNA or RNA of a retroviral expression vector according to 
claim L— 



-11. (Amended) Prokaryotic cell or eukaryotic cell containing a retroviral expression 
vector according to claim 1 .— 
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-12. (Amended) Eukaryotic cell containing a retroviral expression vector according to 
claim 1 in an integrated form.-- 

-14. (Amended) Use of an expression vector according to claim 1 for the expression of 
foreign genes in gene therapy.— 

— 15. (Amended) Virion containing a retroviral expression vector RNA obtained by 
transcription of an expression vector DNA according to claim 1 .— 



—20. (Amended) Retroviral vector system comprising a retroviral expression vector 
according to claim 1 and a packaging cell line comprising at least one retroviral or recombinant 
retroviral construct encoding for the packaging proteins of the retroviral expression vector.- 
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Attorney's Docket No 10737-006001 / P134]9-DrB/la 



REMARKS 



Amendments have been made to the specification to replace the original Sequence 
Listing from the related PCT application with an amended Sequence Listing wherein the general 
information (i.e., inventors, priority data, and attorney docket number) has been amended to 
reflect the information for the instant application. 

Amendments to the claims remove multiple dependency while conserving the claimed 
subject matter. No new matter has been introduced. Claims 1-21 are now pending. Applicants 
submit that all of the claims are now in condition for examination, which action is requested. 

Attached is a marked-up version of the changes being made by the current amendment. 

Please apply any charges or credits to Deposit Account No. 06-1050. 



Fish & Richardson P.C. 

225 Franklin Street 

Boston, Massachusetts 021 10-2804 

Telephone: (617)542-5070 

Facsimile: (617)542-8906 

20308661 doc 



Respectfully submitted. 



Date: 
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Attorney's Docket No 1 0737-00600 I / P 1 34 1 9-DrB/la 

09/91466 

Version with markings to show changes made 

In the claims: 

Claims 3-7, 9-12, 14, 15, and 20 have been amended as follows: 

3. (Amended) Vector according to claim 1 [or claim 2] wherein the whole LTR 
region, the U3 region, or the R and U3 regions are derived from a human endogenous retroviral 
nucleotide sequence. 

4. (Amended) Vector according to [one or more of the preceding claims] claim 1 
wherein said nucleotide sequences encoding one or more proteins or elements of therapeutic and 
cytokin peptides are selected from one or more the group consisting of marker genes, therapeutic 
genes, antiviral genes, anti-tumor genes, and cytokin genes. 

5. (Amended) Vector according to [one or more of the preceding claims] claim 1 
wherein said cell-specifically controllable promoter region is derived from the LTR region of a 
cell-specifically expressed endogenous human retroviral nucleofide sequence. 

6. (Amended) Vector according to [one or more of the preceding claims] claim 1 
wherein said human endogenous retroviral cell-specifically controllable promoter sequences are 
selected from one or more promoter sequences of HERV families of the group consisting of 
HERV-K, HERV-H, HERV-E, HERV-L, HERV-T, HERV-R, HERV-I, HERV-P, ERV9, 
HERV-W. 

7. (Amended) Vector according to [one or more of the preceding claims] claim 1 
wherein said promoter region besides the TATA box additionally comprises recognition and 
binding sites for regulatory proteins. 
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9. (Amended) Vector according to [one or more of the preceding claims] claim 1 
wherein said vector is a promoter conversion vector comprising a 5' LTR portion having the 
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Structure U3-R-U5, one or more sequences selected from coding and non-coding sequences, and 
a 3' LTR portion comprising a U3 region which is partially or completely deleted wherein the 
deleted U3 portion is replaced by a cell-specifically controllable promoter region fi-om a HERV 
LTR sequence, followed by the R-U5 region. 

1 0. (Amended) The mRNA or RNA of a retroviral expression vector according to 
[one or more of the preceding claims] claim 1 . 

1 1 . (Amended) Prokaryotic cell or eukaryotic cell containing a retroviral expression 
vector according to [one or more of the preceding claims] claim 1 . 

12. (Amended) Eukaryotic cell containing a retroviral expression vector according to 
[one or more of the preceding claims] claim 1 in an integrated form. 

14. (Amended) Use of an expression vector according to [one or more of the 
preceding claims] claim 1 for the expression of foreign genes in gene therapy. 

1 5. (Amended) Virion containing a retroviral expression vector RNA obtained by 
transcription of an expression vector DNA according to [one or more of the preceding claims] 
claim 1 . 

20. (Amended) Retroviral vector system comprising a retroviral expression vector 
according to [one or more of the preceding claims] claim 1 and a packaging cell line comprising 
at least one retroviral or recombinant retroviral construct encoding for the packaging proteins of 
the retroviral expression vector. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Christine Leib-Mosch et al. 

Unassigned 

Herewith 

RETROVIRAL EXPRESSION VECTORS ON THE BASIS OF HERV-LONG 
TERMINAL REPEAT SEQUENCES 



BOX PCX 

Commissioner for Patents 
Washington, D.C. 2023 1 

VERIFIED STATEMENT UNDER 37 CFR §1.82 1(f) 

I, Jennifer H. Payne, declare that I personally prepared the paper and the computer- 
readable copy of the Sequence Listing filed herewith for the above-identified application and that 
the content of both is the same. 

I further declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and false statements and 
the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 
of The United States Code, and that such willful false statements may jeopardize the validity of 
the application or any patent issued thereon. 



Date: fio^U^ 9^1 



Fish & Richardson P.C. 

225 Franklin Street 

Boston, Massachusetts 021 10-2804 

(617) 542-5070 telephone 

(617) 542-8906 facsimile 

20309020.doc 




enill^rJI. Payne 



CERTIFICATE OF MAILING BY EXPRESS MAIL 

Express Mail Label No. 

I hereby certify under 37 CFR §1.10 that this correspondence is being 
deposited with the United States Postal Service as Express Mail Post 
Office to Addressee with sufficient postage on the date indicated below 
addressed to the Commissioner for Patents, Washington, 



DC. 20231. 



)r Printed Name of Person Signing Certificate 
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SEQUENCE LISTING 

<110> Christine Leib-Mosch 
Ulrike Schon 
Corinna Baust 

<12 0> RETROVIRAL EXPRESSION VECTORS ON THE BASIS OF 
HERV-LONG TERMINAL REPEAT SEQUENCES 

<130> 10737-006001 

<1B0> PCT/EPOO/02064 
<151> 2000-03-09 

<150> DE 199 10 650.9 
<151> 1999-03-10 

<160> 47 

<170> Patentin Ver . 2.1 

<210> 1 
<211> 375 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 1 

tgtcaggcct ctgagcccaa gctaagccat catatcccct gtgacctgca cgtatacatc 6 0 
cagatagcct gaagcaactg taaaaatatc cttaactgat gacattccaa tattgtgatt 12 0 
tgtttctgcc ctaccctgac tgatcaatgt gctttgtaat ctcccccacc cttcagaagg 180 
ctctttgtaa tcctccccac ccttgagaat ggacttggtg agatccaccc cctgcctgca 240 
aagcattgcc cctaactcca ccgcctgtcc caaaacctat gagaactaat gataatccca 300 
ccacactttg ctgactctct ttccagactc agcccggctg cacccaggtg aaataaacag 360 
ccatgttgct cacat 375 



<210> 2 
<211> 400 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 2 

tgtcaggcct ctgagcccaa gccaggccat cgcatcccct gtgacttgca cgtatacatc 60 
cagatggcct aaagtaactg aagatccaca aaagaagtaa aaacagcctt aactgatgac 12 0 
attccaacat tgtgatttgt tcctgcccca ccctaactga taaatgtact ttgtaatctc 180 
ccccaccctt aagaaggtcc tttgtaattc tccccaccct tgagagtgta ctttgtgaga 24 0 
tccacacctg cccaccagag aacaaacccc ctttgactgt aattttccat taccttccct 300 
aatcctataa aacggcccca ccccatctcc ctttgctgac tctcttttcg gactcagccc 360 
gcctgcaccc aggtgaaata aacagccatg ttgctcacat 400 



<210> 3 
<211> 361 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 3 
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tgtcaggcct ctgagcccaa gctaagccat catatcccct gcgacctgca catatacatc 60 

cagatggcct gaagtaactg aagaatcaca aaagaagtga aaatggcctg ttcctgcctt 120 

aactgatgac attaccttgt gaaattcctt ctcctggctc atcctggctc aaaagctccc 180 

gcactgagca ccttgtgacc cctgcctctg cccgccagag agcaaccccc tcttgactgt 240 

aattttcctt tacctaccta aatcctataa aatggcccca ctcctatctc ccttcgctga 300 

ctctcttttc ggactcagcc cgcctgtacc caggtgaaat aaacagccat gttgctcaca 360 
t- 361 



<210> 4 
<211> 372 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 4 

tgtcaggcct ctgagcccaa gctaagccat catatcccct gtgacctgca cgtacacatc 6 0 
cagatggccg gttcctgcct taactgatga cattccacca cgaaagaagt gaaaatgacc 120 
tgttcgtgcc ttaactgatg acattgtctt gtgaaattcc tcctcctggc tcatcctggc 180 
tcaaaagctc cccgactgag tacattgtga cccccactcc tgcccgccag agaacagccc 24 0 
cctttgactg taattttcct ttatctaccc aaatcctata aaacagcccc acctttatct 300 
cccttggctg actctctttt cggactcagc ccgcctgcac ccaggtgaaa taaacagcca 360 
tgttgctcac at 3 72 



<210> 5 
<211> 349 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 5 

tgtcaggcct ctgagcccaa gccaagccat cgcatcccct gtgacttgca cgtgtatgcc 6 0 
cagatggcct gaagtaactg aagaatcaca aaagaagtga aaaggccctg ccccacctta 12 0 
actgagtgat taaccccatg aatttccttc ccctggctca gaagctcccc cactgagcac 18 0 
cttgtgaccc ctgcccctgc ccaccagaga acaaccccct ttgactgtaa ttttccatta 240 
ctttcccaaa tcctataaaa cggccccacc cctatctccc ttcgctgact ctcttttcgg 300 
actcagcccg cctgcaccca ggtgaaataa acagccatgt tgctcacat 34 9 



<210> 6 
<211> 324 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 6 

tgtcaggcct ctgagcccaa gcctgcacgt atacatccag atgaagcaag tgaagaatca 6 0 
caaaagaagt gaaaatggcc ggttcctgcc ttaactgatg acattacctt gtgaaattcc 120 
ttctcctggc tcagaagctc ccccactgag caccttgtga cccccactcc tccccgccac 180 
agaacaaccc cctttgactg taattttcca ctgcccgccc aaaccctata aaacggtccc 24 0 
accccatctc ccttccctga ctctcttttc ttcggactca gcccgcctgc acccaggtga 300 
aataaacagc catgttgctc acat 324 



<210> 7 
<211> 393 
<212> DNA 

<213> Human endogenous retrovirus 
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<400> 7 

tgtcaggcct ctgagcccaa gccaagccat cgcatcccct gtgacttgca cgtatacgcc 60 

cagatggcct gaagtaactg aagaatcaca aaagaactga aaaggccctg ccccgcctta 120 

actgatgaca ttccaccatg gtgatttgtt cttgccccac cttaactgag tgattaaccc 180 

tgtgaatttg cttctcctgg ctcagaagct cccccactga gcaccttgtg acccccgccc 240 

ctgcccacca gagaacaacc ccctttgact gtaattttcc attaccttcc caaatcctat 300 

aaaacggccc cacccctatc tcccttcgct gactctcttt tcggactcag cccgcctgcc 360 

cccaggtgaa ataaacagcc atgttgctca cat 393 



<210> 8 

<211> 393 

<212> DNA 

<213> Human endogenous retrovirus 



<400> 8 

tgtcaggcct ctgagcccaa gccaagccat cgcatcccct gtgacttgca cgtatacgcc 6 0 
cagatggcct gaagtaactg aagaatcaca aaagaactga aaaggccctg ccccgcctta 120 
actgatgaca ttccaccatg gtgatttgtt cttgccccac cttaactgag tgattaaccc 180 
tgtgaatttg cttctcctgg ctcagaagct cccccactga gcaccttgtg acccccgccc 240 
ctgcccacca gagaacagac ccctttgact gtaattttcc attaccttcc caaatcctat 300 
aaaacggccc cacccctatc tcccttcgct gactctcttt tcggactcag cccgcctgcc 360 
cccaggtgaa ataaacagcc atgttgctca cat 3 93 



<210> 9 
<211> 388 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 9 

tgtcaggcct ctgagcccaa gctaagccat catatcccca gggacctgca cgtatacatc 60 

cagatggcct gaagcaactg aagatccaca aaggaagtga aaatagcctt aactgatgac 120 

attccaccat tgtgatttgt ttctgcccca tcctaactga tcaatgtact ttgtaatctc 180 

tcccaccctt aagaaggttc tttgtaattc tccccaccct tgagagtgta ctttgtgaga 240 

tccaccccct gccggcaaaa cattgctcct aacccaaccg cctaccccaa acctgtaaga 300 

actaatgata atccaccacc ctttgctgac tcttttcaga atcagcccgc ctgcacccag 360 

gtgaaataaa cagccatgtt gctcacat 388 



<210> 10 

<211> 314 

<212> DNA 

<213> Human endogenous retrovirus 



<400> 10 

tgtcaggcct ctgagcccaa gctaagccat caaatcccct gtgacctgca cgtgtacatc 6 0 
cagatgacct gaagcaactg aagatccaca aaagaagtga aagtagcctt aactgatgac 12 0 
attccaccat tgtgatttgt tcctgcccca cgctaactga taccatatat tcttcccccg 180 
cccttgagaa tgtactttgt acacctatcc caaacctata agaactaatg ataatcctac 240 
caccctttgc tgactctctt tttggactca gcccgcctgc acccaggtga aataaacagc 300 
catgttgctc acat 314 



<210> 11 
<211> 309 
<212> DNA 



t;;ii '9! 'i'i ,;i !!■■!& liS. iirs „. ,j. ,i. o 



<213> Human 



us retrovirus 



tgtcaggcct ctgagcccaa gctaagccat catatcccgt gacctgcata tacatccaga 60 

tggcctgaag caactgaaga tccacaaagg aagtgaaaat agccttaact gatgacattc 120 

caccattgtg atttgttcct gccccacgct aactgatacc atatattctt cccccgccct 180 

tgagaatgta ctttgtacac ctatcccaaa cctataagaa ctaatgaCaa tccaccaccc 240 

tttgctgact ctctttttgg actcagcccg cctgcaccca ggtgaaataa acagccatgt 300 

tgctcacat 309 

<210> 12 
<211> 314 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 12 

tgtcaggcct ctgagcccaa gctaagccat caaatcccct gtgacctaca cgtgtacatc 6 0 
cagatgacct gaagcaactg aagatccaca aaagaagtga aagtagcctt aactgatgac 120 
attccaccat tgtgatttgt tcctgcccta cgctagctga taccatatat tcttcccccg 180 
cccttgagaa tgtactttgt acacctatcc caaacctata agaactaatg ataatcctac 240 
caccctttgc tgactctctt tttggactca gcccgcctgc acccaggtga aataaacagc 300 
catgttgctc acat 314 



<210> 13 
<211> 341 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 13 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccga ctaagaatcc 6 0 
ctaagcctag ctgggaaggt gaccgcatcc acctttaaac acggggctcg caacttagct 120 
cacacccaac caatcaggta gtaaagaggg ctcactaaaa tgctaattag gcaaagacag 180 
gaggtaaaga aatagccaat catctattgc ctgagagcac agcaggaggg acaatgatcg 240 
ggatataaac ccaagtcttc gagccggcaa tggctacctt ctttgggtcc cctccctttg 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg c 341 



<210> 14 

<211> 341 

<212> DNA 

<213> Human endog^ 



retrovirus 



<400> 14 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccga ctaagaattc 60 

ctaagcctag ctgggaaggt gaccgcatcc atctttaaac atggggcttg caacttaact 12 0 

catatctgac caatcaggta gtaaagagag ctcactaaaa tgctaattag gctaaaacag 180 

gaggcaaaga agtagccaat catctgttgc ctgacagcac agcaggaggg acaatgatcg 24 0 

ggatataaac ccaggcattc gagccagcta cagctaccct ctttgggtcc cctccctttg 3 00 

tatgggagct ctgtcttcac tctattaaat cttgcaactg c 341 



<210> 15 
<211> 322 
<212> DNA 

<213> Human endogenous retrovirus 



o »ii ,;:ii„ i^ii 's; .„ CI is; i:: 

5 



<400> 15 

tgttgagatg ggggactgag agacaggact acctggattt cctaggccga ctaagaatcc 60 

ctaagcctag ctgggaaggt gaccacatcc acctttaaac acagggcttg caacttagct 12 0 

cacacttgac cagtcaggta gtaaagagag ctcactaaaa tgctaattag gctaaaacag 180 

gaggtaaaga aatagacaat catctatcac ctgagagcac agtgggaggg acaatgatcg 24 0 

gcatataaac ccaggcattc gagccagcaa cagcaacccc ctttgggagc tctgttttca 300 
ctctattaaa tcttgcaact gc 322 



<210> 16 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 16 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccaa ctaagaatcc 6 0 

ctaagcctag ctgggaaggt gactacaccc acctttaaac atggggcttg caacttagct 12 0 

cacacccaac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 18 0 

gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 24 0 

ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 

tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 17 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 17 

tgttgagatg ggggactgag agacaggact agctggattc cctaggccga ctaagaatcc 6 0 
ctaagcctag ctgggaaggt gaccacatcc acctttaaac acggggcttg caacttagct 12 0 
catacccaac aaatcaggta gtaaagagag ctcactaaaa tactgattag gcgaaaacag 180 
gaggtaagga aacagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 24 0 
ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 18 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 18 

tgttgagatg ggggactgag agacaggact agttggattt cctaggctgg ctaagaatcc 6 0 

ctaagcctag ctgggaaatt gaccacgtcc acctttaaac acggggcttg caatttagct 12 0 

cacacccgac caatcaggta gtaaagggag ctcactaaaa tgctaattag ggaaaaacag 18 0 

gaggtaaaga agtagccaat catctatcgc ctgagagcac aacaggaggg acaatgatca 24 0 

ggatataaac ccaggcattc aagccagcgg tggctaccct ctttgggtcc cctccctttg 3 00 

tatgggagcc ctgttttcac tctattaaat cttgcaactg caa 34 3 



<210> 19 
<211> 343 
<212> DNA 

<213> Human endogenous retrovi: 



<400> " 19 

tgttgagatg ggggactgag agacaggact agttggattt cctaggccgg ctaagaatcc 60 

ctaagcctag ctgggaaatt gaccacgtcc acctttaaac acggggcttg caatttagct 120 

cacacccgac caatcaggta gtaaagggag ctcactaaaa tgctaattag ggaaaaacag 18 0 

gaggtaaaga agtagccaat catctatcgc ctgagagcac aacaggaggg acaatgatca 24 0 

ggatataaac ccaggcattc aagccagcgg tggctaccct ctttgggtcc cctccctttg 300 

tatggaagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 20 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 20 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccaa ctaagaatcc 6 0 

ctaagcctag ctgggaaggt gactacaccc acctttaacc actaggcttg caacttagct 12 0 

cacacccgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 180 

gaggtagaga aatagccaat catctatcgc ctgagagcac agcaggaggg acaatgatcc 24 0 

ggatataaac ccaagcattc gagccagcaa tggctaccct ctttgtgtcc cctccctttg 300 

tatgggagct ctattttcac tctattaaat cttgcaactg caa 343 

<210> 21 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 21 

tgttgagatg ggggactgag agacaggact agctggattt cctaggctga ctaagaatcc 6 0 

ctaagcctag ctgggaaggt gaccgcatcc atctttaaac atggggcttg caacttaact 12 0 

catatctgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 18 0 

gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 24 0 

ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 

tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 22 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 22 

tgttgagatg ggggactgag agacaggact agctggattt cctaggctga ctaagaatcc 6 0 

ctaagcctag ctgggaaggt gactacaccc acctttaacc actaggcttg caacttagct 12 0 

cacacccgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 18 0 

gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 24 0 

ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 3 00 

tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 23 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 23 



tgttgagatg ggggactgag agacaggact agttggattt cctaggctgg ctaagaatcc 60 

ctaagcctag ctgggaaatt gaccacgtcc acctttaaac acggggcttg caatttagct 120 

cacacccgac caatcaggta gtaaagggag ctcactaaaa tgctaattag ggaaaaacag 180 

gaggtaaaga agtagccaat catctatcgc ctgagagcac aacaggaggg acaatgatca 240 

ggatataaac ccaggcattc aagccagcgg tggctaccct ctttgggtcc cctccctttg 300 

tatggaagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 24 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 24 

tgttgagatg ggggactgag agacaggact acctggattt cctaggccaa ctaagaatct 60 
ctaagcctag ctgggaaggt gaccacatcc acctttaaac acagggcttg caacttagct 12 0 
cacacccgac ccatcaggta agaaagagag cccgctaaaa tgctaattag gcaaaaacag 18 0 
gaggtaaaga aatagtcaat catctattgc ctgagagcac agcgggaggg acaatgatca 240 
ggatataaac ccaggcattc gagccggcaa cgactaccct ctttgggtcc cctccctttg 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 25 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 25 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccaa ctaagaatcc 6 0 
ctaagcctag ctgggaaggt gactacaccc acctttaaac actaggcttg caacttagct 12 0 
cacacccgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 18 0 
gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 24 0 
ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 26 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 26 

tgttgagatg ggggactgag aaacaggact agcaggattt cctaggccga ttaagaatcc 6 0 
ctaagcctag atgggaagtt gaccacatcc acctttaaac acggggcttg caactcagct 12 0 
cacacccgac ccatcaggta agaaagagag cccgctaaaa tgctaattag gcaaaaacag 180 
gaggtaaaga aatagccaat catctattgc ctgagagcac agcgggaggg acaatgatca 24 0 
ggatataaac ccaggcattc gagccggcaa cgactaccct ctttgggtcc cctccctttg 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 27 
<211> 619 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 27 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 6 0 



cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 120 

ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 

ttacgggtgt cgggctgggg gacggtcagg tctttccctt cccacgaggc catatttcag 240 

actatcacat ggggagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 300 

gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 360 

gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcaccgcc cttaatccat 420 

tcaaccctga gttgacacag cacacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 540 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 

tttccccctt ttcttttcg 619 



<210> 28 
<211> 620 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 28 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 6 0 

cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 12 0 

ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 

ttacgggtgt cgggctgggg gacggtcagg tctttccctt cccacgaggc catatttcag 240 

actatcacat ggggagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 300 

gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 36 0 

gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcaccgcc cttaatccat 420 

tcaaccctga gttgacacag cacacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 54 0 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 
tttccccctt ttcttttcga 620 



<210> 29 
<211> 624 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 29 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 6 0 

cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 12 0 

ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 

ttacgggtgt cgggctgggg gatggtcagg tctttccctt cccacgaggc catatttcag 240 

actatcacat gggaagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 3 00 

gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 36 0 

gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcactgcc cttaatccat 420 

tcaaccctga gttgacacag cgcacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 540 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 
tttccccctt ttcttttcga caaa 624 



<210> 30 

<211> 646 

<212> DNA 

<213> Human endogenous retr- 



<400> 30 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 6 0 



cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 120 

ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 

ttacgggtgt cgggctgggg gacggtcagg tctttccctt cccacgaggc cataCttcag 24 0 

actatcacat ggggagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 300 

gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 360 

gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcaccgcc cttaatccat 420 

tcaaccctga gttgacacag cacacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 540 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 

tttccccctt ttcttttcga caaaaccgcc atctcgagat ctgagt 646 

<210> 31 
<211> 672 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 31 

gtcccacctc cagccctaag gcggtttttc cctatctcag tagatggagc atacaatcgg 6 0 

gttttatacc gagacattcc attgcccagg gacaggcagg agacagatgc cttcctcttg 12 0 

tcLcaactgc aagaggcatt ccttcctctt atactaatcc tcctcagcac agacccttta 180 

cgggtgtcgg gctgggggac ggtcaggtct ttcccttccc acgaggccat atttcagact 240 

atcacatggg gagaaacctt ggacaatacc tggctttcct aggcagaggt ccctgcggcc 3 00 

ttccgcagtt tttgtgtcct gggtacttga gattagggag tggtgatgac tcttaaggag 360 

catgctgcct tcaagcatct gtttaacaag gcacatcctg caccgccctt aatccattca 420 

accctgagtt gacacagcac acgtttcaga gagcacgggg ttgggggtaa ggtcatagat 480 

taacagaatc tcaaggcaga agaatttttc ttaacacata acaaaatgga gtctcccatg 54 0 

tctacttctt tctacacaga cacagtaaca atctgatccc tcttgctttt ccccacattt 600 

cccccttttc ttatccatca cactggcggc cgctcgagca tgcatctaga gggcccaatt 660 
cgccctatag tg 6 72 



<210> 32 
<211> 593 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 32 

agtagatgga gcatacaatc gggttttata ccgagacatt ccattgccca gggacaggca 6 0 

ggagacagat gccttcctct tgtctcaact gcaagaggca ttccttcctc ttttactaat 12 0 

cctcctcagc acagaccctt tacaggtgtc gggctggggg acggtcaggt ctttcccttc 180 

ccacgaggcc atatttcaga ctatcacatg gggagaaacc ttggacaata cctggctttc 240 

ctaggcagag gtccctgcgg ccttctgcag tttttgtgtc cctgggtact tgagattagg 3 00 

gagtggtgat gactcttaag gagcatgctg ccttcaagca tctgtttaac aaagcacatc 360 

ctgcaccgcc cttaatccat tcaaccctga gttgacacag cacatgtttc agagagcacg 420 

gggttggggg taaggtcata gattaacaga atctcaaggc agaagaattt ttcttagcac 480 

ataacaaaat ggagtctcct atgtctactt ctttctacac agacacagta acaatttgat 540 

ctctcttgct tttccccaca tttccccctt ttcttttcga caaaaccgcc ate 593 

<210> 33 
<211> 943 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 33 

tgtgggcgaa ggattaccca ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 6 0 
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ataatataga aaatagctag aataagaata gttataataa aaattagata tacacatgat 120 
catggacatt accaatcatt actacaaaca ttgttaatca ttagctttta atattactct 180 
ttgttttatt actaatataa ccaaggaata accggtagca tacggtcagg tgctgaaggg 240 
acattgtgag aagtgaccta gaaggcaaga ggtgagcctt ctgtcacgcc tgcataagga 3 00 
cagcttgagg gctccttggt caagctgtaa caccagtgcc tgggaaggca ccgttactta 360 
gcagaccatg aaagggagtc tccattcctt ggaggagtca gggaaacact atgctccacc 420 
agcttcttgt gtatccagcc ctgcccacag tcatccagag gcataaaccc ctccctgtgg 480 
tgctgtgctt caatggccat gcttcttgtc cactttcatg ttcctcctgt actcctggtt 540 
cctctttgaa gttcgtagaa gataatggta gaagaaatag tgaaagtctt tgatctttct 600 
tataagtgca tagaagaaaa cactgatgta tgcctgcctt ccctctctgc ttcagctacc 660 
taaaaggaaa ggcccccttt cccatgatca catgacttgc ctgaccttat caatcacttg 720 
gaggactcac cctccttacc ctgtcccttt gtcttgtatg caataaatat cagcacgccc 780 
agccattcgg ggccactact ggtctccgca acttggtggt agtggtaccc tgggcccagc 84 0 
tgttttctct ttatctcttt tgtcttgtgt ctttatttct tacaatctct catctctgca 900 
catggggaga acaccggcaa agcccgtagg gctggacctt aca 94 3 



<210> 34 
<211> 389 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 34 

aaacccctcc ctgtggtgct gtgcttcaat ggccatgctt cttgtccact ttcatgttcc 60 
tcctgtactc ctggttcctc tttgaagttc gtagaagata atggtagaag aaatagtgaa 12 0 
agtctttgat ctttcttata agtgcataga agaaaacact gatgtatgcc tgccttccct 180 
ctctgcttca gctacctaaa aggaaaggcc ccctttccca tgatcacatg acttgcctga 240 
ccttatcaat cacttggagg actcaccctc cttaccctgt ccctttgtct tgtatgcaat 300 
aaatatcagc acgcccagcc attcggggcc actactggtc tccgcaactt ggtggtagtg 36 0 
gtaccctggg cccagctgtt ttctcttta 389 



<210> 35 
<211> 858 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 35 

tgtgggcgga agagtaccta ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 

ataataaaga aaatagaata agaatagtca taatacaaat tagatacagc gatgatcatg 12 0 

aacaattatc catcattatt ataaacatta ttaatcatta gcttttaata ttactctgtt 180 

gcattaataa tataacctag gaataaccgg caggtatagg gtcaggtgct gaagggacat 24 0 

tgtgagaagt gaatagaagg caagagggga gccttctgtc atgcccgcat aagggccgct 300 

tgagggcccc ttggtcaagc ggtaacgcca gtgtctggga aggcacccgt tactgagcag 360 

accgggaaag ggagtctcct ttccttggag gagtcaggga acgctctgct ccaccagctt 42 0 

cttgtgggag gctggatgtt acccaggcct gcctgcagtc atccggaggc ctgaacccct 48 0 

ccctgtggtg cttcaatggt cacgttcctt gtccactttc atgctccttc cgtactcctg 540 

gttcctcttt gaagttcgta gtagatagcg gtagaagaaa tagtgaaagt cttaaagtct 600 

ttgatcttat aagttcatag aagaaaacgc tgatgcctgc cgccttctct ctctgcttca 66 0 

gctacctaag agggaagggc ccgctgtcct gtgatcaggt gacttgcttc accttgtcaa 72 0 

tcacttagaa gactgaccct ccttatcctg cccccttgtc ttgtatgcaa taaatatcag 780 

cgagcccagc cgttcagggc cactaccggt ctccgtgtct ttgtggtagt ggtccccggg 840 
cccagctgtt ttctcttt 858 



<210> 36 
<211> 386 
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<212> DNA 

<213> Human endogenous retrovirus 
<400> 36 

gaacccctcc ctgtggtgct tcaatggtca cgttccttgt ccactttcat gctccttccg 60 

tactcctggt tcctctttga agttcgtagt agatagcggt agaagaaata gtgaaagtct 12 0 

taaagtcttt gatcttataa gttcatagaa gaaaacgctg atgcctgccg ccttctctct 180 

ctgcttcagc tacctaagag ggaagggccc gctgtcctgt gatcaggtga cttgcttcac 24 0 

cttgtcaatc acttagaaga ctgaccctcc ttatcctgcc cccttgtctt gtatgcaata 300 

aatatcagcg agcccagccg ttcagggcca ctaccggtct ccgtgtcttt gtggtagtgg 36 0 
tccccgggcc cagctgtttt ctcttt 386 



<210> 37 
<211> 844 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 37 

tgtgggtgga ggattaccca ggtgccaagg caagagactg aaggcacaaa ctgtttcagt 6 0 

ataataaaaa aaatagaata agaatagtca taatacaaat tagatataga gatgatcatg 12 0 

gacaattagc aatcactatt aatctttagc ttttaatatt actctttgtt gcattactaa 180 

tataacctag gaataaccgg tgggtatagg gtcaggtgct gaagggacat tgtgtgaagt 24 0 

gacctggaag gcaagaggtg agccctctgt cacgcccaca taagggccgc ttgagggctc 3 00 

cttggtcaag tggtaacgcc agtgtctggg aatgcacccg ttaattagca gaccgcgaaa 36 0 

gggagtctcc tttccttgga agagttgggg aacactctgc tccaccagct tcttgtggaa 420 

ggctggatat tatccaggcc tgcgcgcagt catccggagg cttaaaccct tccctgtggt 4 80 

gctgtgcttc aatggtccca ctccttgtcc actttcatgc tcctcccgta ctcctggttc 540 

ctctttgaag agcgcagtag atagcggtag aagaaatagt gaaagtctta aagtcttcga 6 00 

tctttcttac aagtgcagag aagaaaacgc tgacatatgc tgccttccct ctctgcttcg 660 

gctacctaaa agggaagggc cgcctatcct gtaatcacat gacttgcttc accfctgtcaa 72 0 

tcacttagaa gattcactct ccttaccctg cccccttgtc ttgtatgcaa taaatatcag 78 0 

tgaccccagc cgttcagggc cactactggt ctccgcgtct tgatggtagt ggtcaccccg 84 0 
gccc 844 



<210> 38 
<211> 381 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 38 

aaacccttcc ctgtggtgct gtgcttcaat ggtcccactc cttgtccact ttcatgctcc 6 0 
tcccgtactc ctggttcctc tttgaagagc gcagtagata gcggtagaag aaatagtgaa 12 0 
agtcttaaag tcttcgatct ttcttacaag tgcagagaag aaaacgctga catatgctgc 180 
cttccctctc tgcttcggct acctaaaagg gaagggccgc ctatcctgta atcacatgac 240 
ttgcttcacc ttgtcaatca cttagaagat tcactctcct taccctgccc ccttgtcttg 3 00 
tatgcaataa atatcagtga ccccagccgt tcagggccac tactggtctc cgcgtcttga 3 60 
tggtagtggt caccccggcc c 3 81 



<210> 39 
<211> 859 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 39 
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tgtgggtgga ggattaccca ggtgccaagg caagagactg aaggcacaaa ctgtttcagt 60 

ataataaaaa aaatagaata agaatagtca taatacaaat tagatataga gatgatcatg 12 0 

gacaattagc aatcactatt aatctttagc ttttaatatt actctttgtt gcattactaa 180 

tataacctag gaataaccgg tgggtatagg gtcaggtgct gaagggacat tgtgagaagt 240 

gacctggaag gcaagaggtg agccctctgt cacgcccaca taagggccgc ttgagggctc 300 

cttggtcaag tggtaacgcc agtgtctggg aatgcacccg ttaattagca gaccgcgaaa 360 

gggagtctcc tttccttgga agagttgggg aacactctgc tccaccagct tcttgtggaa 42 0 

ggctggatat tatccaggcc tgcgcgcagt catccggagg cttaaaccct tccctgtggt 480 

gctgtgcttc aatggtccca ctccttgtcc actttcatgc tcctcccgta ctcctggttc 540 

ctctttgaag agcgcagtag atagcggtag aagaaatagt gaaagtctta aagtcttcga 600 

tctttcttac aagtgcagag aagaaaacgc tgacatatgc tgccttccct ctctgcttcg 560 

gctacctaaa agggaagggc cgcctatcct gtaatcacat gacttgcttc accttgtcaa 720 

tcacttagaa gattcaccct ccttaccctg cccccttgtc ttgtatgcaa taaatatcag 780 

tgaccccagc cgttcagggc cactactggt ctccgcgtct tgatggtagt ggtcaccccg 840 
gcccaggtgt tttttcttt 859 



<210> 40 
<211> 396 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 40 

aaacccttcc ctgtggtgct gtgcttcaat ggtcccactc cttgtccact ttcatgctcc 60 
tcccgtactc ctggttcctc tttgaagagc gcagtagata gcggtagaag aaatagtgaa 12 0 
agtcttaaag tcttcgatct ttcttacaag tgcagagaag aaaacgctga catatgctgc 18 0 
cttccctctc tgcttcggct acctaaaagg gaagggccgc ctatcctgta atcacatgac 24 0 
ttgcttcacc ttgtcaatca cttagaagat tcaccctcct taccctgccc ccttgtcttg 300 
tatgcaataa atatcagtga ccccagccgt tcagggccac tactggtctc cgcgtcttga 360 
tggtagtggt caccccggcc caggtgtttt ttcttt 3 96 



<210> 41 
<211> 966 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 41 

tgtgggtgga ggattaccca ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 6 0 

ataataaaga aaatggttag aataagaata gtcataatac aaattagata tagagatgat 12 0 

catggacaat tatcaatcat tattataaac attattaatc attagctttt aatattactc 180 

tttgttgcat tactaatata acctaggaat aaccggtggg tatagggtca ggtgctgaaa 240 

ggacattggg agaagtgacc tagaaggcaa gaggtgagtc ttctgtcacg cccgcataag 3 00 

ggttgcttga gggctccttg gtcaagtggt aacgccggtg tctgggaagg cacctgttac 360 

ttagccgacc acgaaaggga gtctcctttc cttggaggag tcagggcgca ctctgctcca 42 0 

ccagcttctt gtggaaggct ggatattatc caggcctgcc cgcagtcatc cggaggccta 480 

aacccctccc tgtggtgctg tgcttcaatg ggcacactcc tcgtccactt tcatgttcct 540 

cccatactcc tggtttctct ttgaagttcg tagtagatag tggtagaagg aatagggaaa 6 00 

atcttaaagt gtttgatctt tcttataagt gcatagaaga aaacgctgac atatgctgcc 660 

ttctctgtct gcttcagcta cctaagaggg aagggccccc tgtccagtga tcacgtgact 720 

tgcttcacct tgtcaatcac ttagaagatt caccctcctt accctgcccc cttgtcttgt 780 

atgcaataaa tatcagtgca cccagccttt cggggccact taccggtctc cacgtcttgg 840 

tggtagtggt cccccgggcc cagctgtttt ctctttatct ctttgtcttg tgtcttattt 900 

attacaatct ctcgtctccg cacacaggga gaacacccgc taagctccgt agggctggac 96 0 

cctaca 966 



<210> 42 
<211> 398 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 42 

aaacccctcc ctgtggtgct gtgcttcaat gggcacactc ctcgtccact ttcatgttcc 60 
tcccatachc ctggtttctc tttgaagttc gtagtagata gtggtagaag gaatagggaa 120 
aatcttaaag tgtttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 
cttctctgtc tgcttcagct acctaagagg gaagggcccc ctgtccagtg atcacgtgac 240 
ttgcttcacc ttgtcaatca cttagaagat tcaccctcct taccctgccc ccttgtcttg 300 
tatgcaataa atatcagtgc acccagcctt tcggggccac ttaccggtct ccacgtcttg 360 
gtggtagtgg tcccccgggc ccagctgttt tctcttta 398 



<210> 43 
<211> 938 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 43 

tgtgggtgga ggattaccca ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 6 0 

ataataaaga aaatggttag aataagaata gtcataatac aaattagata tagagatgat 12 0 

catggacaat tatcaatcat tattataaac attattaatc attagctttt aatattactc 180 

tttgttgcat tactaatata acctaggaat aaccggtggg tatagggtca ggtgctgaag 24 0 

ggacattggg agaagtgacc tagaaggcaa gaggtgagtc ttctgtcacg cccgcataag 300 

ggttgcttga gggctccttg gtcaagtggt aacgccggtg tctgggaagg cacctgttac 360 

ttagccgacc acgaaaggga gtctcctttc cttggaggag tcagggcaca ctctgctcca 420 

ccagcttctt gtggaaggct ggatattatc caggcctgcc cgcagtcatc cggaggccta 480 

aacccctccc tgtggtgctg tgcttcaatg ggcacactcc tcgtccactt tcatgttcct 540 

cccatactcc tggttcctct ttgaagttcg tagtagatag tggtagaagg aatagggaaa 600 

atcttaaagt gtttgatctt tcttataagt gcatagaaga aaacgctgac atatgctgcc 660 

ttctctgtct gcttcagcta cctaagaggg aagggccccc tgtccagtga tcacgtgact 72 0 

tgcttcacct tgtcaatcac ttagaagatt caccctcctt accctgcccc cttgtcttgt 780 

atgcaataaa tatcagtgca cccagccttt cggkkcactt accggtctcc acgtcttggt 840 

ggtagtggtc ccccggccca gctgttttct ctttatctct ttgtcttgtg tcttatttat 900 

tacaatctct cgtctccgca cacagggaga acacccgc 938 



<210> 44 
<211> 396 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 44 

aaacccctcc ctgtggtgct gtgcttcaat gggcacactc ctcgtccact ttcatgttcc 6 0 
tcccatactc ctggttcctc tttgaagttc gtagtagata gtggtagaag gaatagggaa 12 0 
aatcttaaag tgtttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 
cttctctgtc tgcttcagct acctaagagg gaagggcccc ctgtccagtg atcacgtgac 24 0 
ttgcttcacc ttgtcaatca cttagaagat tcaccctcct taccctgccc ccttgtcttg 300 
tatgcaataa atatcagtgc acccagcctt tcggkkcact taccggtctc cacgtcttgg 360 
tggtagtggt cccccggccc agctgttttc tcttta 3 96 



<210> 45 
<211> 963 
<212> DNA 



<213> Human endogenous retrovirus 
<400> 45 

tgtgggcgaa agattaccta ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 

ataataaaga aaatagttaa aataagaata gttataatac aaattagata tagagatgat 120 

catggacaat tatcaatcat tattataaac attaatcatt agcttttaat attactcttt 180 

gttgctttac taatataacc taggaataac cggtgggtat agggtcaggt gttgacggga 240 

tattgtgaga agtgacctag aaggcaagag gtgagccttc tgtcacgccc acataagggc 3 00 

cgcttgaggg ctctttggtc aagtggtaac gccagtgtct gtgaaggcac ctgttactta 360 

gcagaccgcg aaagggagtc tcctttcctt ggaggagfcca gggaacactc tgctccacca 420 

gcttcttgtg gaaggctgga tattatctag gcctgcccgc agtcatctgg aggcctaaac 480 

ccctccctgt ggtgctgtgc ttcagtggtc actctccttg tccactttca tgttcctccc 540 

gtactcctgg ttcctctttg aagttcgtag tagatagcag tagaagaaat agtgaaagtc 6 00 

ttaaagtatt tgatctttct tataagtgca tagaagaaaa cgctgacata tgctgccttc 660 

tctatctctg cggtggctac ctaaaaggga agggccccct gtcccatgat catgtgactt 720 

gcttcacctt atcacttaga agattcatcc tccttaccct gcgccccctc gtcttgtatg 780 

caataaatat cagcacgccc agtcgtttga ggccactgcc ggtctccgcg tcttggtggt 840 

agtggtcccc cgggcccagc tattgtctct ttatctcttt gtcttgtgtc tttatttatt 900 

acaatctctt gtctctgcac acagggagaa cacctgctaa gccccgtagg actggaccct 96 0 
aca 963 



<210> 46 
<211> 397 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 46 

aaacccctcc ctgtggtgct gtgcttcagt ggtcactctc cttgtccact ttcatgttcc 60 
tcccgtactc ctggttcctc tttgaagttc gtagtagata gcagtagaag aaatagtgaa 120 
agtcttaaag tatttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 
cttctctatc tctgcggtgg ctacctaaaa gggaagggcc ccctgtccca tgatcatgtg 240 
acttgcttca ccttatcact tagaagattc atcctcctta ccctgcgccc cctcgtcttg 300 
tatgcaataa atatcagcac gcccagtcgt ttgaggccac tgccggtctc cgcgtcttgg 360 
tggtagtggt cccccgggcc cagctattgt ctcttta 397 



<210> 47 
<211> 489 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 47 

tgttcaattc tttgccttct acttttaaac ttaacttcct cataaagcaa cctttttcaa 60 
tcacctgctc cactctgact cattctgatc acctgctcca ccctgactca ttccgatcac 120 
ctgatccact gtgactcatt ccgattaccc gctccaccct gactcattct gattctgatt 180 
tcctgctctg ccataaccat ttttcccgcc aaaccactca ccctgtcact ctctttaaat 240 
tagccaattg gaattagttt agcctgtgcg gtctaaccct agccaatagg ggactgacac 300 
agcagcaggg gccacatgtg tcaggaataa gacccccttc ccctccctgt ccagatgtgt 360 
gctcaccatt gctccatctg tgagggcaca cccttctata gaagtaaatt gccttgctga 420 
gaagaaaaaa aagaacattt tatattcaag tcctatttct tttgctgcac cgaaacttta 480 
tttataaca 489 
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<213> Human endogenous retrovirus 
<400> 45 

tgtgggcgaa agattaccta ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 6 0 

ataataaaga aaatagttaa aataagaata gttataatac aaattagata tagagatgat 12 0 

catggacaat tatcaatcat tattataaac attaatcatt agcttttaat attactcttt 180 

gttgctttac taatataacc taggaataac cggtgggtat agggtcaggt gttgacggga 24 0 

tattgtgaga agtgacctag aaggcaagag gtgagccttc tgtcacgccc acataagggc 3 00 

cgcttgaggg ctctttggtc aagtggtaac gccagtgtct gtgaaggcac ctgttactta 360 

gcagaccgcg aaagggagtc tcctttcctt ggaggagtca gggaacactc tgctccacca 420 

gcttcttgtg gaaggctgga tattatctag gcctgcccgc agtcatctgg aggcctaaac 480 

ccctccctgt ggtgctgtgc ttcagtggtc actctccttg tccactttca tgttcctccc 540 

gtactcctgg ttcctctttg aagttcgtag tagatagcag tagaagaaat agtgaaagtc 6 00 

ttaaagtatt tgatctttct tataagtgca tagaagaaaa cgctgacata tgctgccttc 660 

tctatctctg cggtggctac ctaaaaggga agggccccct gtcccatgat catgtgactt 72 0 

gcttcacctt atcacttaga agattcatcc tccttaccct gcgccccctc gtcttgtatg 780 

caataaatat cagcacgccc agtcgtttga ggccactgcc ggtctccgcg tcttggtggt 84 0 

agtggtcccc cgggcccagc tattgtctct ttatctcttt gtcttgtgtc tttatttatt 900 

acaatctctt gtctctgcac acagggagaa cacctgctaa gccccgtagg actggaccct 95 0 
aca 963 



<210> 46 
<211> 397 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 46 

aaacccctcc ctgtggtgct gtgcttcagt ggtcactctc cttgtccact ttcatgttcc 60 

tcccgtactc ctggttcctc tttgaagttc gtagtagata gcagtagaag aaatagtgaa 12 0 

agtcttaaag tatttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 

cttctctatc tctgcggtgg ctacctaaaa gggaagggcc ccctgtccca tgatcatgtg 24 0 

acttgcttca ccttatcact tagaagattc atcctcctta ccctgcgccc cctcgtcttg 300 

tatgcaataa atatcagcac gcccagtcgt ttgaggccac tgccggtctc cgcgtcttgg 360 
tggtagtggt cccccgggcc cagctattgt ctcttta 3 97 



<210> 47 
<211> 489 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 47 

tgttcaattc 

tcacctgctc 

ctgatccact 

tcctgctctg 

tagccaattg 

agcagcaggg 

gctcaccatt 

gaagaaaaaa 

tttataaca 



tttgccttct 
cactctgact 
gtgactcatt 
ccataaccat 
gaattagttt 
gccacatgtg 
gctccatctg 
aagaacattt 



acttttaaac 
cattctgatc 
ccgattaccc 
ttttcccgcc 
agcctgtgcg 
tcaggaataa 
tgagggcaca 
tatattcaag 



ttaacttcct 
acctgctcca 
gctccaccct 
aaaccactca 
gtctaaccct 
gacccccttc 
cccttctata 
tcctatttct 



cataaagcaa 
ccctgactca 
gactcattct 
ccctgtcact 
agccaatagg 
ccctccctgt 
gaagtaaatt 
tttgctgcac 



cctttttcaa 60 
ttccgatcac 120 
gattctgatt 180 
ctctttaaat 240 
ggactgacac 3 00 
ccagatgtgt 360 
gccttgctga 420 
cgaaacttta 48 0 
489 
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Retroviral expression vectors on the basis of HERV LTR 
sequences 



The present invention relates to retroviral expression 
vectors bearing promoters which may be cell-specifically 
controlled. The vectors are useful for example for the cell- 
specific expression of genes of therapeutic value in the 
context of a gene therapy. 



Retroviruses are RNA viruses wherein the viral genes are 
encoded by a single-stranded RNA molecule. Following entry of 
the viruses into the cell, the viral RNA is converted into a 
double-stranded DNA molecule by means of reverse 
transcription. The DNA enters the nucleus and integrates into 
the cellular chromosome. The integrated form of viral DNA, 
the so-called provirus, represents the template for the 
expression of viral genes. 



Integration of the viral genome into the cellular chromosome 
is an obligatory step of viral replication and is mediated by 
virus-encoded enzymes. With few exceptions it appears that 
the viability of the infected cell is not or almost not 
affected by the presence of the retroviral genome in the 
cell, the expression of its genes and the formation of viral 
particles . 

Retroviral gene transfer is used for the introduction of 
functional genes, in particular genes of therapeutic value, 
into cells without affecting the ability of the host cell to 
proliferate. Due to their mode of replication retroviruses 
are suitable for such gene transfer. In the most simple 



embodiment, at least a portion of the viral genes is replaced 
by a gene of interest and, by using the efficient viral 
infection process, this gene of interest is transferred into 
the target cell. 

Retroviral vectors are suitable for gene therapy because the 
infection by retroviruses occurs with high efficiency and the 
retroviral vectors may be modified to incorporate 
heterologous DNA and may stably integrate into the genome of 
the host cell. A plurality of retroviral vectors has been 
developed in recent years, and by way of example reference is 
made herein to the reviews by Giinzburg et al . (1996) and 
Robbins et al . (1998) . 

A possible preferred embodiment for retroviral vectors are 
so-called ProCon vectors which have been described for the 
first time in WO 96/07748. For the disclosure reference is 
made to this document in its entirety. 

ProCon vectors have heterologous promoter elements and 
optionally other regulatory elements in their 3' LTR which 
following infection are duplicated and translocated to the 5' 
LTR in the target cell and which are capable of controlling 
the expression of marker genes or therapeutic genes. These 
heterologous genes are not directly linked to the promoter 
but are inserted inside the vector. 

ProCon vectors comprise an 5' LTR porti'on having the 
structure U3 , R, U5, and at least one coding and/or non- 
coding sequence as well as a 3 ' LTR region comprising a U3 
portion which is completely or partially deleted wherein the 
deleted U3 portion has been replaced by a polylinker sequence 
followed by the R and U5 portions. 
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Propagation of these vectors is performed by means of a 
helper cell line producing high amounts of viral proteins 
which are no longer synthesized by the expression vector 
itself. However, the helper cell line is no longer capable of 
producing a replication-competent virus. This cell line is 
also referred to as packaging cell line and comprises a cell 
line transfected with at least a second plasmid carrying the 
genes which enable the packaging of the modified retroviral 
vector. In this respect, reference is made to WO92/10564 
which is incorporated herein by reference in its entirety. 

The DNA encoding the modified retrovirus (expression vector) 
is transfected into the packaging cell line. Under these 
conditions the modified retroviral genome comprising the 
inserted therapeutic genes or marker genes, respectively, is 
transcribed and packaged into retroviral particles 
(recombinant viral particles) . Then, this recombinant virus 
is used for the infection of target cells; the genome of the 
modified retrovirus, i.e. the expression vector, is 
integrated into the target cell genome, wherein this 
integration occurs together with the marker genes or 
therapeutic genes, respectively. A cell infected with the 
recombinant viral particles generated in this manner is 
unable to produce new vector virus since no other viral 
proteins are present in these cells. The DNA of the 
expression vector containing the genes of therapeutic value 
or marker genes, respectively, which ha's been integrated into 
the host cell is present in the cellular- DNA in an integrated 
form and may subsequently be expressed in the cell. 

Preferably, genes of therapeutic value for which an 
expression is achieved by means of such retroviral expression 



vectors are to be expressed in a cell- and tissue-specific 
manner. For this purpose, cell-specific regulatory sequences 
are introduced into the LTR sequence of the expression 
vector. For example, these cell-specific regulatory sequences 
comprise cell-specifically controllable promoter regions, 
cell-specific enhancer sequences as well as binding sites for 
transcription factors. The promoters are localized in the U3 
portion of the LTR. 

Presently, cellular promoter sequences or promoter sequences 
of exogenous retroviruses are inserted into retroviral 
vectors . 

Cellular promoters often require additional signal structures 
which may be present at a great distance upstream or 
downstream of the promoter. Therefore, it has always been 
found difficult to isolate strong, tissue-specific, cellular 
promoter sequences and to clone them into retroviral vectors. 
Promoters of exogenous retroviruses bear the advantage that 
they contain, within the retroviral LTR, all the necessary 
regulatory elements in a confined region and, therefore, may 
be transcribed essentially independent of neighboring DNA 
sequences at the site of integration. However, a severe 
disadvantage is that although they are strong, they ar^ not 
tissue-specific and generally are expressed with equal 
strength in all cell types. 

Therefore, it is an object of the preseht invention to 
provide novel retroviral expression vectors which utilize the 
benefits of retroviral promoters to concentrate all signal 
structures required for transcription in a confined region 
within the U3 and R regions, but simultaneously avoid the 
disadvantages associated therewith. 
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According to the present invention, this object has been 
achieved by inserting into retroviral expression vectors a 
cell-specifically controllable promoter portion derived from 
a human endogenous retroviral DNA nucleotide sequence (HERV) . 
These promoter sequences of human endogenous retroviral 
viruses are already present in the host cell, and it has been 
found according to the invention that they are excellently 
suitable for regulation of the cell-specific expression of 
marker genes and genes of therapeutic value. 

Endogenous retroviruses (ERV) may be found in the genome of 
all cells in an organism. They are transferred vertically via 
the germ line and may be reactivated by conditions caused by 
the environment. About 2% of the human genome consist of 
endogenous retroviruses and retroviral sequences, solitary 
HERV LTRs being present in an amount of 2 0,000-40,000 copies 
per genome (Tab. 1) (Leib-Mosch et al . , 1993; Wilkinson et 
al., 1994; Patience et al . , 1997). 

Since HERV sequences have been integrated into the primate 
genome already 30-40 millions of years ago, it may be assumed 
that in the course of evolution most of the pathogenic 
sequences were eliminated from the provirus by mutations and 
rearrangements or have been modified, respectively, to be no 
longer disadvantageous for the organism. Compared to vectors 
derived from animal viruses, retroviral vectors constructed 
from such sequences have the advantage that no new viral 
sequences must be introduced into the genome. In addition, 
also by recombination with HERV sequences already present in 
the genome no novel retroviruses may arise as it might be the 
case if retroviruses of other species were used as vectors. 
For this reason, the use of these sequences in the 
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construction of retroviral vectors can minimize the safety 
risk. Furthermore, homologous regions contained in the genome 
may be utilized for a tissue-specific integration of the 
retroviral vectors into specific sites of a chromosome. 

In the course of evolution, HERV elements have adopted a 
number of cellular functions. For example, promoter and 
enhancer elements of HERV LTRs are used for the 
transcriptional control of cellular genes (Kato et al . , 
Feuchter-Murthy et al . , 1993; Di Christofano et al . , 1995). 
One example for the use of LTR regulatory elements for a 
tissue-specific expression of a cellular gene is the human 
amylase gene. This gene is controlled by the LTR of an HERV-E 
element and in this manner is restricted specifically to be 
only expressed in saliva glands (Ting et al . , 1992). 
Moreover, Schulte and co-workers (1996) have shown that the 
insertion of an endogenous retrovirus into the 5' 
untranslated region of the pleiotrophin gene is responsible 
for the throphoblast-specif ic activity thereof (Schulte et 
al . , 1996). In other instances, polyA signals of HERV LTRs 
may also serve to polyadenylate cellular transcripts (Mager, 
1989; Goodchild et al . , 1992). 

Since retroviruses must maintain their transcriptional ' 
activity most independently of the surrounding regions of 
their site of integration, the main advantage of the use of 
retroviral promoters resides in the fact that all signal 
structures required for transcription ate localized in a 
confined region within the U3 region and the R region of the 
LTR, as described above. Because these HERV promoters have 
persisted in the primate genome since millions of years they 
have adapted during evolution to be active in a cell type- 
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specific manner similar to cellular promoters and thus 
combine the advantages of cellular and retroviral promoters. 

HERVs are transcribed starting from a classical RNA 
polymerase II promoter (Wilkinson et al . , 1994) . This 
promoter is localized within the LTR region. Therefore, the 
HERV transcript comprises no complete copy of the provirus . 
To compensate for the loss of transcription control elements, 
these elements have developed the mechanism of reverse 
transcription by which the lost sequences at both ends of the 
elements are regenerated from which in turn the LTRs are 
regenerated. Besides promoter sequences the HERV LTRs also 
contain a plurality of different binding sites for 
transcription factors (Seifarth et al . , 1998) responsible for 
the tissue-specificity of expression. 

Although there is a certain structural similarity between 
HERVs and exogenous animal retroviruses, such as MLV or MMTV, 
HERV sequences and in particular HERV promoters' have never 
been considered as possible candidates for the development of 
retroviral expression vectors. In contrast, up to now they 
have only been considered as disruptive factors in view of 
gene therapy (Patience et al . , 1997) . Because of sequence 
homologies, there has been concern that they might interfere 
with the therapeutic vector by recombination in the target 
cell. Although it was impossible up to now to confirm these 
concerns by experimentation, however, a problem arose in the 
development of very efficient human packaging cell lines that 
co-packaging and inadvertent transfer of potentially 
infectious HERV sequences occurred- Therefore, a detailed 
study of the possible packaging of expressed HERV sequences 
into virions based on MLV has been conducted. Patience et al . 
(1998) identified mRNA transcripts of several different HERV 



families, such as HERV-K and HERV-H, in human packaging cell 
lines. Even by using a highly sensitive RT-PCR test, however, 
none of these sequences could be detected in the MLV vector 
particles released by the cells. 

According to these findings, a packaging and transfer of HERV 
sequences and therefore, eventually, also of HERV-based 
vectors in MLV packaging systems seemed to be out of 
question. Although HERV genes have a sequence homology of 50- 
65% with respect to MLV genes, particularly the regions which 
are essential for packaging and infection, and in particular 
the packaging signal localized between the 5' LTR and the gag 
region as well as the LTR itself have no detectable sequence 
homology with respect to the corresponding MLV sequences. 
Since up to now there are no cell lines known which produce 
HERV particles in sufficient amounts, however, presently, 
efficient HERV packaging systems are also not conceivable. 

Thus, the present invention solves the problem of retroviral 
expression vectors controlling the cell- and tissue-specific 
expression of foreign genes (gene of interest) by providing 
expression vectors containing, in functional assembly, at 
least the following elements: 

a) DNA sequences for packaging of the vector RNA and for the 
cell-specific expression of proteins or peptides encoded by 
heterologous DNA nucleotide sequences; 

b) one or more heterologous DNA nucleotide sequences 
(transcription unit) encoding a protein or peptide ; 
wherein the DNA sequences for cell-specific expression are 
characterized by comprising a cell-specifically controllable 
promoter region derived from a human endogenous retroviral 
virus, in particular from the LTR sequence of said virus. 
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The promoter region of the HERV sequence may comprise the 
whole LTR region of the HERV. However, in another embodiment 
of the present invention, the promoter region only comprises 
the U3 region or the R-U3 region of the HERV LTR. In another 
preferred embodiment of the present invention, besides these 
regions the promoter region also comprises the untranslated 
region between the 5' LTR and the gag genes. It has been 
found according to the present invention that this region 
also contains sequences which control the cell-specific 
expression of proteins or peptides, respectively, i.e. which 
at least contribute to said cell-specific expression. 

The promoters are partial regions of DNA required for the 
start of transcription of the corresponding structural genes. 
The promoter includes the transcription start site, the 
recognition and binding site for RNA polymerase. The promoter 
may also comprise other sequences to which regulatory 
proteins may bind and which thereby specifically control the 
initiation of transcription. Examples of such proteins are 
transcription factors and repressors. Examples of said 
regulatory elements of the transcription activity are the 
CAAT box, GC box and TATA box. The promoters are recognized 
by a type II polymerase. 

The promoter regions for the cell-specific expression of 
foreign proteins from HERVs may optionally be combined with 
other sequences derived from exogenous ^retroviruses which 
promote cell-specific expression. In addition, there is 
considered a combination with regulatory sequences from 
cellular genes to support cell-specific expression. 
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Furthermore, the retroviral expression vector according to 
the present invention at least contains DNA sequences for 
packaging of the vector by means of a packaging helper cell 
line. The DNA sequences for packaging are localized between 
the 5' LTR and the gag gene. Such packaging signals are 
present in any retroviral vector and therefore known to the 
skilled artisan. Examples of packaging signals are listed in 
Mann et al . , 1985, and Rein, 1994, as well as the literature 
cited therein. This literature is incorporated by reference 
in its entirety. 

The retroviral expression vector according to the present 
invention contains one or more transcriptional units encoding 
an amino acid sequence. The amino acid sequence refers to a 
protein or peptide. Any sequence encoding a protein or 
peptide of interest may be inserted into the expression 
vector. For example, such proteins or peptides may be encoded 
by marker genes, genes of therapeutic value, genes with 
antiviral function, anti-tumor genes and/or cytokin genes. 
This list could be continued to any number. The genes which 
can be introduced into the retroviral expression vector are 
known to those skilled in the art. The type of genes inserted 
depends on the intended use of the vector according to the 
present invention. 

For example, the vectors according to the present invention 
may be employed for gene therapy to transfer heterologous DNA 
into target cells in order to render di'seases accessible to a 
specific therapy. The vector DNA is introduced into the 
selected target cell so that the heterologous DNA is 
expressed in the target cell and the product encoded by the 
DNA is produced. This includes particularly such genes for 
the expression of proteins which are not produced or not 
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produced any longer or not produced in sufficient amounts by 
the target cell so that a disease condition develops. The 
invention not only comprises such proteins or peptides, 
respectively, which occur naturally but also those which have 
been modified in a manner to achieve a desired effect, for 
example a higher enzyme activity, blocking of a binding site 
for viruses, destruction of tumor cells by suicide genes, 
etc . 

Generally, the DNA nucleotide sequences encoding a protein or 
a peptide is heterologous DNA encoding RNA and proteins which 
the cell in which the proteins or peptides, respectively, are 
expressed usually does not produce in vivo. It may also be 
referred to as foreign DNA. This includes any protein, such 
as enzymes, hormones, and antibodies. Therefore, the 
retroviral expression vectors provided by the present 
invention are designed to express proteins of interest in 
human cells. 

The promoter regions employed according to the present 
invention are selected from HERV sequences derived from the 
HERV families known. Examples of these are HERV-K, HERV-H, 
HERV-E, HERV-L, HERV-T, HERV-R, HERV-I, HERV-P, ERV9, HERV-W. 

It should be understood that it is also possible to screen 
other, presently unknown HERV families in order to find 
promoter sequences which are presently unknown and which 
regulate the cell-specific expression. '• 

Preferred LTR sequences from HERVs according to the present 
invention which may be employed for the tissue-specific 
expression of proteins and peptides are disclosed in the 
annex. They may be introduced into retroviral expression 



vectors to achieve the object according to the present 
invention. It should be understood, however, that by means of 
methods known per se it is also possible to select only 
portions of these LTRs to keep the sequences inserted into 
the vector as small as possible. Useful fragments may be 
selected using various deletion mutants. Furthermore, also 
other variations of these LTR sequences are possible, e.g. 
point mutations, insertions, additions, substitution of 
several nucleotides, etc. in order to increase the efficiency 
of the tissue-specific expression and to adapt it to the 
desired function. 

In a preferred embodiment according to the present invention 
the ProCon vectors described at the beginning are employed. 
Such ProCon vector comprise a 5' LTR region having the 
structure U3-R-U5, one or more sequences encoding a protein 
or peptide and optionally non-coding sequences as well as a 
3' LTR portion comprising a partially or completely deleted 
US region wherein the deleted U3 portion at least comprises 
the HERV LTR sequences employed according to the present 
invention, followed by an R-U5 region. Further details are 
described for example in WO96/07748 and W096/28564. These 
documents are included herein by reference in their entirety. 

According to the present invention a strategy has been 
developed to track down promoter sequences having a cell- 
specific function. This strategy is described in more detail 
in the following description. It has to' be understood that 
principally also other methods for the finding of HERV LTR 
sequences which act in a cell-specific manner may be 
considered and used. Thus, the present invention is riot 
limited to the following examples. 
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The retroviral expression vectors according to the present 
invention are packaging deficient, i.e. are unable to produce 
viral particles without assistance by a packaging helper cell 
line. Therefore, the present invention also comprises a 
retroviral vector system containing a retroviral expression 
vector as described in the present invention and a packaging 
cell line which contains at least a retroviral or recombinant 
retroviral construct encoding the packaging proteins of the 
retroviral expression vector. Such packaging cell lines are 
known per se and have been described. By way of example, 
reference is made herein to the murine packaging cell line 
PA317 (Sailer et al . , 1998). 

In the following, the invention will be described in general, 
followed by a description with respect to Examples . 

According to the present invention, the applicability of 
human endogenous retroviruses for the development of tissue- 
specific vectors for gene therapy has been investigated. For 
this purpose, first the tissue-specificity of HERV pol 
transcription has been examined in different cell lines, such 
as T cells, keratinocytes and breast cancer cells using a 
"reverse dot blot" procedure. In this test, the expression 
pattern of the various HERV families was found to be 
generally cell type dependent. To isolate HERV LTRs with 
transcriptional activity from different cell lines and 
tissues, primers were developed which could be used for the 
specific amplification of the U3/R regions from mRNA 
preparations . The isolated LTR sequences as well as 
individual members of already known LTRs were inserted into 
expression vectors. Following transient transfection of the 
reporter plasmids, the activity of the LTR promoters was 
tested in the different cell lines via the luciferase 
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activity or eGFP fluorescence, respectively. The promoter 
activity of individual HERV LTRs was found to vary clearly 
dependent on the cell line tested. The promoter region of a 
HERV-H LTR isolated from astrocytes and liver cells which was 
found to be especially active in lung fibroblast cells (LC5) 
in several tests was inserted into two retroviral promoter 
conversion vectors (pLESN and pLX) , tested in packaging cell 
lines, the packaging efficiency was evaluated, and after 
infection of the target cell was tested for the occurrence of 
a promoter conversion. FACS analyses were performed to detect 
the transcriptional activity in the target cells. 

Thus, a method has been described by which HERV promoter 
seguences (U3/R region) mediating a tissue-specific 
expression may be identified and isolated. Subseguently, the 
tissue-specificity and promoter activity of these sequences 
was tested in a transient transfection assay in various human 
cell lines. Eventually, suitable sequences were chosen, 
cloned into a promoter conversion vector (ProCon vector) 
whereby their usefulness for the construction of tissue- 
specific vectors for gene therapy was examined. The 
preparation of the retroviral expression vectors according to 
the present invention is carried out using recombinant 
techniques known per se. Such techniques are for example 
described in Sambrook et al . , 1989, and Perbal, 1984. For the 
construction of the ProCon vectors see WO 96/07748 already 
mentioned at the beginning an the related literature. 

3. Results 



3.1 Analysis of HERV transcription in different cell types 
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To investigate HERV transcription in different cells a method 
(reverse dot blot hybridization) was employed in the first 
step which had been originally developed for the detection of 
HERV expression in peripheral blood mononucleated cells 
(Herrmann and Kalden, 1994) . For this method, cloned and 
characterized HERV pol gene fragments from human genomic DNA 
were immobilized on a membrane and hybridized with 
radiolabeled HERV pol gene probes. The probes were amplified 
from mRNA of different cells using RT PGR and degenerated 
oligonucleotides homologous to a highly conserved region of 
retroviral pol genes (Shih et al., 1989; Donehower et al . , 
1990) . Using this method we obtained a characteristic 
hybridization pattern with every cell line examined so far 
which was the first result to indicate a tissue-specific 
expression of HERV elements. 

3.2 Isolation of LTR U3 regions of expressed HERVs 

The tissue-specific expression of a retrovirus is primarily 
defined by its U3 region. In this region, all regulatory 
sequences are localized, such as promoter, enhancer, and 
binding sites for various cellular transcription factors. For 
this reason, primers were developed which could be used for 
the specific isolation by means of RT PGR of these HERV 
sequences from the mRNA of different cell lines (Tab. 2; Fig. 
1) . In this manner, about 30 different HERV LTRs were cloned. 
In part, these sequences were tested in a reporter plasmid 
for their promoter activity and tissue-specificity. 

In a first approach, for the PGR a polydT primer was combined 
with a primer complementary to the polypurine stretch (PPT) 
of retroviral RNA (Fig. 1) . The PPT stretch is a conserved 
portion in the non- translated region between the env gene and 
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the U3 region of the 3' LTR. During reverse transcription of 
the retrovirus, the PPT region is used as a primer binding 
site for plus strand synthesis (Sorge and Hughes, 1982). 

By means of data base analyses, the PPT sequences of 
different HERV families were identified and classified into 
different groups by comparing their homology. From the 
consensus sequences of individual groups oligonucleotides 
were synthesized as primers for RT PGR. The mRNA was prepared 
from different cell lines: epithelial cells (HeLa, HaCaT) , 
fibroblast cells (LC5), T cells (H9, HUT78), lymphoblasts 
(CML) , glioma cells (85HG66, U373), pancreatic cells 
(MiaPaCa2, Panel), liver cells (Chang Liver), and breast 
cancer cell lines (T47-D, MCF7) . Moreover, cDNA libraries 
(Clontech) of various human tissues (brain, heart, liver, 
kidney, lung, pancreas, placenta, skeletal muscle) were also 
employed in the RT PGR. 

Subsequently, the fragments obtained were cloned, sequenced, 
and analyzed by means of data base comparison. 7\mong the PGR 
fragments obtained with PPT and polydT primers two LTRs were 
assigned to the families of HERV-H and HERV-K due to a 
comparison of homologies. By using polydT primers in these 
PGR samples, numerous sequences were amplified which did not 
reveal any homologies to known retroviral LTRs and moreover 
did not contain any promoter structural elements. For this 
reason, other sequences were selected for primer synthesis 
from conserved regions of the U3 region^ and from the R region 
of the HERV-K and HERV-H families (Mold et al . , 1997) (Fig. 
1, Tab. 1) . The resulting PGR products were separated on an 
agarose gel, followed by transfer to nitrocellulose filters 
and hybridization with probes prepared from the LTR regions 
of different HERV LTRs ( HERV-K-pl 1 67 , HERV-H-H6, HERV-E, 
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HERV-L) . Afterwards, the hybridizing fragments were cloned 
into a vector (pZERO, Invitrogen) and sequenced. Using this 
method, several HERV LTRs could be isolated which are listed 
in Table 3. 

The HERV-K LTRs isolated from human brain and heart tissue as 
well as from T47-D cells show very strong sequence homologies 
to the 3' LTR of HERV-KIO. In contrast, the HERV-H LTRs 
exhibited much higher sequence variations. HERV-H31, HERV-H3, 
HERV-HCMl, HERV-HCM4, HERV-HMP23 are homologous to the HERV- 
H-H6 LTR isolated by Mager et al . , the other HERV-H sequences 
show homologies to the HERV-H LTRs from vervet monkey, 
marmoset and man isolated by Anderssen et al . (1997) . The 
HERV-W LTRs isolated from T47-D cells are related to the LTR 
of clone CL6 (Komur ian-Pradel , 1989) . 

3.3 Analysis of the expression of HERV promoters in a 
transient luciferase assay 

For an analysis of the promoter activity and tissue- 
specificity of the isolated HERV LTRs, these were first 
cloned into a luciferase reporter plasmid (pBL, Butz, K., 
DKFZ, Heidelberg) . This vector contains the luciferase gene 
of Photinus pyralis fused to the SV40 polyA signal of pBLCAT2 
(Hoppe-Seyler et al . , 1991). 

The individual vector constructs were transiently transfected 
into different cell lines. After 48 h, the luciferase 
activity from the cell lysate was measured using the 
luciferase assay kit of Promega company and was determined as 
the relative luciferase activity after standardization for B- 
galactosidase activity or Renllla luciferase activity, 
respectively. The LTR promoter activities were determined in 



epithelial cells (HeLa, HaCaT) , fibroblast cells (LC5) , T 
cells (H9, HUT78) , glioma cells (85HG66, U373), liver cells 
(Chang Liver) , pancreatic cells (MiaPaCa2, Panel) , and breast 
cancer cell lines (T47-D, MCF7 ) . 

The results are presented in Figures 2a^2f. According to 
these results, among all endogenous LTRs tested the HERV-H-H6 
LTR has the strongest promoter. The HERV-K LTR from placenta 
is particularly active in HeLa cells. In all other cell 
lines, this LTR exhibits only a very weak activity. Also in 
HeLa cells, HERV-K-T47-D showed a strong activity, this LTR 
was also active in HaCaT cells and pancreatic cells. The 
HERV-L LTR has a strong promoter activity in liver cells and 
a weak activity in T cells and pancreatic cells. The HERV-T- 
S71A and HERV-E LTRs were active in none of the cell lines 
tested. Also, no activity at all of a HERV LTR could be 
observed up to now in CML cells. 

Almost all of the cloned HERV-H LTRs (HERV-Hl, 'HERV-H8, HERV- 
H13, HERV-H19, HERV-H H6, Tab. 3) were active in 85HG66 cells 
while HERV-Hl and HERV-H8 showing the highest activity in 
this cell line (not shown) . HERV-H19 was very active in HeLa 
cells. The HERV-HCMl LTR exhibited the highest promoter 
activity in all cell lines and was especially active in lung 
fibroblasts (LC5) (Fig. 3) . 

3.4 Construction of HERV hybrid vectors and monitoring of the 
activities of HERV promoters in these vectors 

The functionality of human endogenous retroviral LTR 
sequences in retroviral vectors was tested in two different 
promoter conversion vectors (ProCon) . For this purpose, 
hybrid HERV/MLV vectors were constructed using two vectors 



pLESN-MMTV (Fig. 7) and pLX-MMTV (Fig. 8) on the basis of 
MLV. These vectors include the EGFP gene as a reporter gene 
which is expressed from the 5' LTR (in varying amounts 
depending whether measured before or after the promoter 
conversion) as well as a neomycin gene which is expressed 
from an SV4 0 promoter. Moreover, vector pLX-MMTV contains a 
prokaryotic origin of replication enabling recloning of the 
provirus for further molecular characterizations. 

To construct the HERV hybrid vectors, in each case the MMTV 
LTR was replaced by the HERV-HCMl LTR (Fig. 7) . For this 
purpose, the LTR was amplified first by means of PGR from the 
vector pBL-HERV-H using specific primers which contained 
additional sequences for the restriction enzymes Mlul and 
SacII. Then, these fragments were inserted into the vectors 
having their 3' U3 deleted. After transfection into the 
packaging cell line, the EGFP reporter gene is first 
expressed from the MLV promoter (Fig. 9a) . After infection of 
the target cells and successful promoter conversion by 
reverse transcription in the target cells, the reporter gene 
is present under the transcriptional control of the HERV LTR. 

The HERV hybrid vector constructs pLESN-HERV-H (Fig. 7) and 
pLX-HERV-H (Fig. 8) as well as the parent vectors pLESN-MMTV 
and pLX-MMTV were transfected into the amphotrophic packaging 
cell line PA317. Afterwards, the resulting retroviral vector 
particles were used for the infection of cell lines CrfK and 
LC5 . < 

The infected cell lines were cloned and the selected cellular 
clones were examined for the presence of vector constructs 
and for the occurrence of promoter conversion. For this 
purpose, chromosomal DNA was prepared from infected and 
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uninfected cells and analyzed by means of PGR. The primers 
were selected from the MLV U3 (P5) and R (P2) region as well 
as the HERV-H region (PI) and used for the PGR in combination 
with a primer for the EGFP region {Fig. 9a) . The PGR products 
were hybridized with HERV-H-specif ic probes (Fig. 9b) . After 
amplification with the primers PI and P3, the DNA infected by 
pLX HERV-H particles yielded a PGR product of 1.1 kb which 
hybridized to the HERV-H probe. Amplification using MLV US- 
specific primers (P2/P3) with DNA of cells infected with pLX 
and pLX HERV-H gave PGR products having a size of about 900 
bp which showed not hybridization to the HERV-H probe. No PGR 
product hybridizing to the HERV-H probe was obtained from the 
amplification using MLV R primers {P5/P3) . These results show 
that promoter conversion has occurred and that the MLV 
promoter of the 5' LTR had been replaced by the HERV 
promoter . 

After integration into the target cell DNA, the HERV LTR 
promoter activity in the retroviral vectors was determined by 
FAGS analyses via the measurement of the EGFP fluorescence 
(Fig. 10) . For this purpose, the activity of the starting 
vector pLX-MMTV was compared with that of the HERV vector 
pLX-HERV-H (H6) prior to and after induction with 
dexamethasone . The vector containing the MMTV LTR may be 
activated by dexamethasone. The vector containing the HERV 
LTR is not activated by dexamethasone, however, its activity 
is by a factor of 10 higher as compared to the dexamethasone- 
stimulated MMTV hybrid vector. '• 



3.5 Effect of regulatory elements in the R and U5 regions on 
the promoter activity of HERV sequences 
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In order to examine which sequence region is required for a 
functional HERV promoter, the effect of additional LTR 
sequences localized outside of the U3 region in the LTR was 
investigated in several examples. For this purpose, the 
activity in the luciferase assay of the U3 region of 7 HERV-K 
LTRs (HERV-K-T47D, L5, L50, L8, L9, L48, and L20/49) was 
compared to the activity of the corresponding U3-R fragments. 
It was surprisingly found that the different R regions are 
able to affect the promoter in the US region in a very 
different manner. In the group 1 LTRs (L5, L50, L8, L9) the 
presence of R sequences resulted in a marked increase in 
promoter activity in all cell lines tested (Fig. 4a) . In 
contrast, in the group 2 LTRs (L20/L49) the HERV promoter 
activity is reduced by the R region (Fig. 4b) . The HERV-K- 
T47D promoter (Fig. 5) and the L48 promoter (not shown) are 
substantially unaffected by the respective R sequences. 
Interestingly, in the case of the HERV-K-T47D LTR sequence 
regions localized downstream of the U3-R region and 
comprising the U5 region as well as the 3' non-'translated 
region and the start of the gag gene have a clearly 
activating effect (Fig. 5). 

A sequence analysis of the different R regions tested 
revealed that group 1 LTRs have a binding site for 
transcription factor SPl in the R region which is missing 
from the R region in group 2 LTRs (Fig. 6) . In contrast, the 
group 2 R region contains a potential binding site for factor 
TFS3 which acts as a repressor of transfer ipt ion . This shows 
that the activity of HERV promoters may be modified by 
insertion of additional regulatory elements such as 
transcription factor binding sites, enhancer sequences, or 
negative-regulatory elements. 
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Amended Claims 



2 . 



3 . 



Retroviral expression vector containing at least the 
following elements in functional assembly: 

a) DNA sequences for packaging of the vector TWA and for 
cell-specific expression of proteins or peptides encoded 
by heterologous DNA nucleotide sequences; 

b) one or more DNA nucleotide sequences encoding a 
protein or peptide 

wherein said DNA sequences for the cell-specific 
expression contain a cell-specifically controllable 
promoter region from a human endogenous retroviral DNA 
nucleotide sequence (HERV) . 

Expression vector according to claim 1 wherein said DNA 
sequences for cell-specific expression are derived from 
the DTK region and optionally from the non- translated 
region between the 5 ' LTR and the gag region of HERVs . 

Vector according to claim 1 or claim 2 wherein the whole 
LTR region, the U3 region, or the R and U3 regions are 
derived from a human endogenous retroviral nucleotide 
sequence . 

Vector according to one or more of the preceding claims 
wherein said nucleotide sequences encoding one or more 
proteins or peptides are selected from one or more 
elements of the group consisting of marker genes, 
therapeutic genes, antiviral genes, anti-tumor genes, 
and cytokin genes . 
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5 . Vector according to one or more of the preceding claims 
wherein said cell-specifically controllable promoter 
region is derived from the LTR region of a cell- 
specifically expressed endogenous human retroviral 
nucleotide sequence. 

6. Vector according to one or more of the preceding claims 
wherein said human endogenous retroviral cell- 
specifically controllable promoter sequences are 
selected from one or more promoter sequences of HERV 
families of the group consisting of HERV-K, HERV-H, 
HERV-E, HERV-L, HERV-T, HERV-R, HERV- I, HERV-P, ERV9 , 
HERV-W . 

7 . Vector according to one or more of the preceding claims 
wherein said promoter region besides the TATA box 
additionally comprises recognition and binding sites for 
regulatory proteins . 

8 . Vector according to claim 7 wherein said recognition and 
binding sites for regulatory proteins comprise the GC 
box, the CAAT box, enhancer sequences and repressor 
sequences as well as hormone responsive sequence motifs 
and wherein, optionally, additional recognition and 
binding sites for regulatory proteins from the LTR 
region of exogenous retroviruses and/or from cellular 
genes are comprised. 

9 . Vector according to one or more of the preceding claims 
wherein said vector is a promoter conversion vector 
comprising a 5' LTR portion having the structure U3 -R- 
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U5 , one or more sequences 
coding sequences, and a 3 
region which is partially 
the deleted U3 portion is 
specifically controllable 
LTR sequence, followed by 



selected from coding and non- 

LTR portion comprising a U3 
or completely deleted wherein 
replaced by a cell- 
promoter region from a HERV 
the R-U5 region. 



10 . The mRNA or KNA of a retroviral expression vector 
according to one or more of the preceding claims. 



11. Prokaryotic cell or eukaryotic cell containing a 

retroviral expression vector according to one or more of 
the preceding claims. 



12 . Eukaryotic cell containing a retroviral expression 

vector according to one or more of the preceding claims 
in an integrated form. 

13 . Use of a cell-specifically controllable promoter region 

from a human endogenous retroviral DNA nucleotide 
sequence for the regulation of the expression of foreign 
genes in retroviral expression vectors, preferably in 
ProCon vectors. 



14 . Use of an expression vector according to one or more of 
the preceding claims for the expression of foreign genes 
in gene therapy . 



15 . Virion containing a retroviral expression vector RNA 
obtained by transcription of an expression vector DNA 
according to one or more of the preceding claims. 
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16. Method for the preparation of a virion according to 
claim 14 for the introduction of one or more nucleotide 
sequences encoding a protein or peptide wherein said 
retroviral expression vector according to one or more of 
the preceding claims is introduced into a suitable 
packaging cell line under such conditions that the 
virion is formed and released by the packaging cell 
line . 

17 . Method for the introduction of nucleotide sequences 
encoding one or more proteins or peptides into an 
eukaryotic cell wherein said cell is infected by a 
virion as defined in claim 14 under such conditions that 
the nucleotide sequences encoding the protein or peptide 
is inserted into the chromosomal DNA of the eukaryotic 
cell . 

18. Method according to claim 17, wherein the eukaryotic 
cell is a mammalian cell. 

19. Process according to claim 18, wherein the mammalian 
cell is a human cell. 

20. Retroviral vector system comprising a retroviral 
expression vector according to one^or more of the 
preceding claims and a packaging cell line comprising 
at least one retroviral or recombinant retroviral 
construct encoding for the packaging proteins of the 
retroviral expression vector. 
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Retroviral vector system according to claim 20, where 
the packaging cell line comprises retroviral or 
recombinant retroviral constructs encoding for such 
retroviral proteins which are not encoded by the 
retroviral expression vector. 
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SUMMARY 

The present invention relates to retroviral expression 
vectors with cell-specifically controllable promoters. For 
example, the vectors may be used for the cell-specific 
expression of genes of therapeutic value in the context of a 
gene therapy. 

The present invention describes retroviral expression vectors 
containing at least the following elements in functional 
assembly : 

a) DNA sequences for the packaging of the vector RNA and for 
cell-specific expression of proteins or peptides encoded by 
heterologous DNA nucleotide se<juences; 

b) one or more DNA nucleotide sequences encoding a protein or 
peptide wherein said DNA sequences for the cell-specific 
expression thereof contain a cell-specifically controllable 
promoter region from a human endogenous retroviral DNA 
nucleotide sequence (HERV) . 
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Tab. 2: Primers used for the amplification of different 
HERV LTR regions 
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A: HERV LTRs from different cell lines and tissues 
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regions of transcribed HERV sequences 
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in various cell lines 

Fig. 3a: Cell line plotted vs. relative promoter activity 



Fig. 3b: 



Cell line plotted vs. relative promoter activity 
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Fig. 4: LTR R region modulated/promoter activity of HERV-K- 
T47D-related LTRs 

A: Activation, cell line plotted vs. relative 
promoter activity 

B: Inhibition, cell line plotted vs. relative 
promoter activity 

Fig. 5: Sequences downstream of LTR R modulate the promoter 
activity of HERV-K-T47D-related LTRs 
Cell line plotted vs. relative promoter activity 

Fig. 6: Regulatory elements in the R region of HERV-K T47D 
LTRs 

Fig. 7: Retroviral ProCon vectors pLESN-MMTV and pLESN- 
HERV-H 

Fig. 8: Retroviral ProCon vectors pLX-MMTV and pLX-HERV-H 

Fig. 9: a) Promoter conversion of ProCon hybrid vectors 
b) Detection of correct promoter conversion by 
means of PGR and hybridization using a HERV-H and a 
psi probe 

Fig. 10: a) Organization of the two ProCon vectors pLX-MMTV 
and pLX-HERV-H 

b) Promoter activity of the HERV-H LTR as compared 
to the MMTV LTR following infection of CrfK cells 
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Tab. 1 : Human endogenous retroviral elements 









% of 




HERV family 


Copy number 


genome 


Class 1 HERVs 


HERV-ERI 




0.07 % 


(type C- related 


HERV-E (4-1, ERVA, NP-2) 


35 - 50 




HERVs) 


HERV-E LTR 


500 - 600 






51-1 


35 - 50 






ERV1 ' 


10-15 






HERV-R (ERV3) 


10 






RRHERV-I 


20 






HERV-T (S71 , CRTK1 , CRTK6) 


50 - 60 






HERV-T LTR 


150 - 200 






ERV-FRD 


8 






HERV-RW 




0.2% 




HERV-W (MSRV) 


25 - 50 






HERV-R (ERV9) 


30-40 






ERV9 LTR 


3000 - 4000 






HERV-P (HuERS-P, HuRRS-P) 


50-90 


0.01% 




HERV-IP 




0.01% 




HERV-I (RTVL-I) 


25 - 50 






HERV-IP-T47D (ERV-FTD) 


35 






HERV-IP LTR 


1800-2000 






HERV-HF 




0.2% 




HERV-H (RTVL-H, RGH) 


900 - 1000 






HERV-F 


16 






HERV-H-LTR 


1000 




Class II HERVs 


HERV-K 




0.5% 


(type A-, B-, and 


HERV-K(HML-I) 


10-20 




D-related 


HERV-K(HML-2) 


30 - 50 




HERVs) 


HERV-K10 








HERV-K-HTDV 








HERV-K-IDDM 








HERV-K(HML3) 


25 






HERV-K(HML-4) 


6 






HERV-K-T47D 








HERV-K(HML-6) 


30-40 






HERV-K-HML-6P 








HERV-K-HML-6.17 








HERV-K(HML-7) 


? 






HERV-K-NMW7 








HERV-K(HML-8) 


? 






HERV-K(HML-9) 


? 






HERV-K-NMW9 








HERV-K(HML-IO) 


10 - 50 






HERV-KC4 








HERV-K LTR 


10 000 - 25 000 




Foamy virus- 


HERV-L 


100 - 200 


0.02% 


related HERVs 
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Tab. 2: Primers used for the amplification of different HERV-LTR-regions 





''S J . Primer; " 


^■"i"^ ' "'^ ~ Sequence '^^ 


34 


HERV-K 


ATGGCGGTTTTGTCGAA 


35 


HERV-K 


GTTCCMTYAGTATTTATTGATC 


36 


HERV-K 


ATGGAGCATACAATCGGG 


3 


HERV-K " 


AAGAAAAGGGGGAAATGTGGG 


11 


HERVKC4 


AAAGGGAGGGGGGCATG 


12 


HERV-KT47-D 


TAAAAAGGGGGGAGATG 


1 


HERV-H 


ATGTGAGCAACATGGCTGTTTATTTC 


2 


HERV-H 


TGTCAGGCCTCTGAGCCCAA 


39 


HERV-H 


GCCATCTCGAGTGTCAGSCCTCTGAGYCYARGC 


37 


HERV-H 


TATCTTGAATTCGKGTGAGCAAYAARRCTTTA 


31 


polydT 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


17 


HERV-E 


AAAGGGGGGGAAATATG 


18 


HERV-L 


AGGGGTGGGACTTGCGATG 


19 


HERV-W 


TGTTGAGATGGGGGAGTGAG 


20' 


HERV-W 


GCAGTTGCAAGATTTAATAGAG 
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Tab. 3: Analyzed HERV-LTRs 3/29 
A: HERV-LTRs from different cell lines and tissues 





Primer 


Herkunft 


Homology 


HERV-K2 _ 




T47-D 


HERV-K10, Ml 2854, 97,9 % 


HERV-K3 


34/36 


T47-D 


HERV-K10, Ml 2854, 98,4 % 


HERV-K22-K32-K27-K45 




brain 


HERV-K10, M12854, 98,6 % * 


HERV-K30 




^leart 


HERV-K10, M12854, 97,6 % 


HERV-K-T47D-L5 








HERV-K-T47 D-L50 




T47d 




HERV-K-T47D-L8 




T47d 


MRSV AFl 27229 


HERV-K-T47D-L9 




^^^"^ ' 




HERV-K-T47D-L48 




T47d 




HERV-K-T47D-L20 






MRSV. API 27229 


HERV-IP-T47D 






MRSV, AF127229 


HERV-T47D-W2 






MRSV, AF 1 27229 


HERV-T47D-W4 




^TB 


MRSV, AFl 27229 


HERV-T47D-W5 






MRSV, AFl 27229 


HERV-W1 




Iatd 




HERV-W10 




T47d 




HERV-W11 






MRSV. AFl 27229 


HERV-W1 8 






MRSV, AF127229 


H ERV-W2 




jr47-D 


MRSV, AFl 27229 


HERV-W22 




T47-D 


MRSV, AFl 27229 


HERV-W23 


19/20 


T47-D 


MRSV, AFl 27229 


HERV-W4 


19/20 


T47-D 


MRSV, AF127229 


HERV-W5 


19/20 


T47-D 


MRSV, AFl 27229 


HERV-W6 


19/20 


T47-D 


MRSV, AFl 27229 


HERV-W8 


19/20 


T47-D 


MRSV, AFl 27229 


HERV-H1 


1/2 


H9 


Cercopithecus aethiops ERV-H; U96012, 87,1% 


HERV-H8 


1/2 


HUT 


HERV-H LTR18106. 84,8% 


HERV-H13 


1/2 


HUT 


HERV-H LTR18106. 91,8% 


HERV-H19 


1/2 


liver 


Caiiithrixjacchus ERV-H, 5'LTR; U96052, 92,1% 


HERV-H31 


1/2 


liver 


HERV-H(H6) X12717. 99,8% 


HERV-H3 


1/31 


85HG66 


HERV-H(H6) X12717. 100 % 


HERV-H CL1 


1/2 


Chang Liver 


HERV-H(H6) X12717, 100 % 


HERV-H CL2 


1/2 


Chang Liver 


HERV-H LTR18106, 84 % 


HERV-H CL3 


1/2 


Chang Liver 


Cailithrix jacchus ERV-H, 5 LTR Sitva 5, U96057, 84,2 % 


HERV-H CL4 


1/2 


Chang Liver 


HERV-H(H6)x12717. 100% 


HERV-H PA7 


1/2 


Panel 


Caiiithrixjacchus ERV-H, 5'LTR Silva 4, U96062. 85,7 % 


HERV-H PAS 


1/2 


Rand 


Cercopithecus aethiops ERV-H, Vero 22, U96012, 87,1% 


HERV-H PA9 


1/2 


Panel 


HERV-H LTR18106, 85 % 


HERV-H PA10 






Caiiithrixjacchus ERV-H, 5'LTR Siiwa 4, U96062, 85,6 % 


HERV-H MC14 


1/2 


MCF7 


Cercopithecus aethiops ERV-H, Vero 22, U96012, 86,6% 


HERV-H MC15 


1/2 


MCF7 


Cercopithecus aethiops ERV-H, U96012, 86,6 % 


HERV-H MC16 


1/2 


MCF7 


Caiiithrixjacchus ERV-H. 5'LTR Sitva 4, U96062, 87.4 % 


HERV-H MC17 


1/2 


MCF7 


Cercopithecus aethiops ERV-H, Vero 22, U96012, 86,6% 


HERV-H MP20 


1/2 


MiaPaca 


Human beta globin retrovirus-like repetitive element. k01891, 92.8 % 


HERV-H MP21 


1/2 


MiaPaca 


HERV-H LTR18106, 89,2 % 


HERV-H MP23 


1/2 


MiaPaca 


HERV-H(H6) X12717. 99,5 % 
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B: HERV-LTRs published in the literature 







reference , ? ^ 


HERV-K-PI167 


970 


Leib-MOsch etal. 1993 


HERV-K-T47-D 


1200 


Seifarth etal.. 1998 


HERV-H-H6 


393 


Feuchter und Mager, 1990 


HERV-T-S71A 


625 


Murr, Dissertation, 1998 


HERV-E 


391 


Steele etal , 1984 


HERV-L 


462 


Cordonnieref a/., 1995 
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HERV-K-T47D ^ HERV-K-PI165 




Abb. 2g: relative promoter activity of different HERV-LTRs in different cell lines 
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Fig. 4: LTR-R region modulates promoter activity of HERV-K-T47D related LTRs 
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Fig.7: Retroviral ProCon vectors pLESN-MMTV and pLESN-HERV-H 
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Fig.8: Retroviral ProCon vectors pLX-MMTV and pLX-HERV-H 
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HERV-H probe Psi probe 



U3 R U5 



U3 R U5 



HERV-H MLV 



transcription 



HERV-H MLV 

P2 ^ P3 

P1 ^ P3 



HERV-H MLV 



reverse transcription and 
promoter conversion 



HERV-H MLV 



b) 



HERV-H probe 



Psi probe 



P2-P3 P1-P3 P5-P3 

1 2 3 1 2 3 1 2 3 



P2-P3 P1-P3 P5-P3 

1 2 3 1 2 3 1 2 3 



Fig. 9: a) Promoter conversion of the hybrid ProCon vectors 

b) Demonstration of the correct promoter conversion with PGR and hybridization 
with a HERV-H and a psi probe (1:CK; 2:CK-pLX-MMTV; 3:CK-pLX-HERV-H) 




pLX- 
HERV-H 



HERV-H MLV 




Fig.10: a) organization of the two ProCon vectors pLX-MMTV and pLX-HERV-H 

b) promoter activity of the HERV-H LTR in comparison to the MMTV-LTR by 
infection of CrfK cells 
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HERV-H li31 

HERV-H HCM2 

HERV-H 19 

HERV-H MP 2 0 

HERV-H CMS 

HERV-H MCI 6 

HERV-H CMl 

HERV-H MP 2 3 

HERV-H HI 3 

HERV-H HI 

HERV-H HU8 

HERV-H PAT 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



HERV- 
HERV- 
HERV- 



HERV- 
HERV- 
HERV- 
HERV 
HERV- 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV 
HERV- 



H r.31 

H HCM2 

H 19 ^ 

H MP20 

H CM3 

H MC16 

H CMl 

H MP23 

H H13 

H HI 

H HU8 

H PA7 



H L31 

H HCM2 

H 19 

H MF20 

H CM3 

H MC16 

H CMl 

H MP23 

H H13 

H HI 

■H HU8 

H PA7 



H L31 
-H HCM2 
■H 19 
•H MP20 
■H CM3 
■H MC16 
-H CMl 
-H MP23 
-H H13 
-H HI 
-H HUB 
-H PA7 



HERV-H LSI 

HERV-H HCM2 

HERV-H 19 

HERV-H MP 20 

HERV-H CM3 

HERV-H MCI 6 



TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT. 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 

51 

CGTATACATC 
CGTATACATC 
CATATACATC 
CGTACACATC 
CGTGTATGCC 
CGTATACATC 
CGTATACGCC 
CGTATACGCC 
CGTATACATC 
GGTGTACATC 
. .TATACATC 
CGTGTACATC 

101 

T 

CAAAAGAAGT 
CAAAAGAAGT 
CGAAAGAAGT 
CAAAAGAAGT 
CAAAAGAAGT 
CAAAAGAAGT 
CAAAAGAAGT 
CAAAGGAAGT 
CAAAAGAAGT 
CAAAGGAAGT 
CAAAAGAAGT 

151 

TTGTGATTTG 
TTGTGATTTG 



TGGTGATTTG 
TGGTGATTTG 
TTGTGATTTG 
TTGTGATTTG 
TTGTGATTTG 
TTGTGATTTG 

201 

CCCCCACCCT 
CCCCCACCCT 
TCCTTCTCCT 
TCCTCCTCCT 
TCCTTCCCCT 
TCCTTCTCCT 



CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 



CAGATAGCC . 
CAGATGGCC . 
CAGATGGCC . 
CAGATGGCCG 
CAGATGGCC . 
CAGATG .... 
CAGATGGCC . 
CAGATGGCC . 
CAGATGGCC . 
CAGATGACC . 
CAGATGGCC . 
CAGATGACC . 



GCTAAGCCAT 
GCCAGGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 
GCCAAGCCAT 

GCC 

GCCAAGCCAT 
GCCAAGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 



TGAAG 

TAAAG 

TGAAG 

GTTCCTGCCT 

TGAAG 

AAG 

TGAAG 

TGAAG 

TGAAG 

TGAAG 

TGAAG 

TGAAG 



CATATCCCCT 
CGCATCCCCT 
CATATCCCCT 
CATATCCCCT 
CGCATCCCCT 

CGCATCCCCT 
CGCATCCCCT 
CATATCCCCA 
CAAATCCCCT 
CATATCCC . . 
CAAATCCCCT 



50 

GTGACCTGCA 
GTGACTTGCA 
GCGACCTGCA 
GTGACCTGCA 
GTGACTTGCA 
. . . . TGCA 
GTGACTTGCA 
GTGACTTGCA 
GGGACCTGCA 
GTGACCTGCA 
GTGACCTGCA 
GTGACCTACA 



AAAAATATCC 
AAAAACAGCC 
GAAAATGGCC 
GAAAATGACC 
GAAAAGGCCC 
GAAAATGGCC 
GAAAAGGCCC 
GAAAAGGCCC 
GAAAATAGCC 
GAAAGTAGCC 
GAAAATAGCC 
GAAAGTAGCC 



100 

CAACTG 

TAACTGAAGA TCCA 

TAACTGAAGA ATCA 

TAACTGATGA CATTCCACCA 

TAACTGAAGA ATCA 

CAAGTGAAGA ATCA 

TAACTGAAGA ATCA 

TAACTGAAGA ATCA 

CAACTGAAGA TCCA 

CAAGTGAAGA TCCA 

CAACTGAAGA TCCA 

CAACTGAAGA TCCA 



150 

TTAACTGATG ACA TTCCAATA 

TTAACTGATG ACA TTCCAACA 

TGTTCC 

TGTTCG 

TGCCCC 

GGTTCC 

TGCCCCGCCT TAACTGATGA CATTCCACCA 
TGCCCCGCCT TAACTGATGA CATTCCACCA 

TTAACTGATG ACA TTCCACCA 

TTAACTGATG ACA TTCCACCA 

TTAACTGATG ACA TTCCACCA 

TTAACTGATG ACA TTCCACCA 



TTTCTGGCCT 
TTCCTGCCCC 



TTCTTGCCCC 
TTCTTGCCCC 
TTTCTGCCCC 
TTCCTGCCCC 
TTCCTGCCCC 
TTCCTGCCCT 



TCAGAAGGCT 
TAAGAAGGTC 
GGCTGATCCT 
GGCTCATCCT 
GGCTCAG. . . 
GGCTCAG. . . 



ACCCTGAGTG 
ACCCTAACTG 
GCCTTAACTG 
GCCTTAACTG 
AGGTTAACTG 
GCCTTAACTG 
ACCTTAACTG 
AGGTTAACTG 
ATCCTAACTG 
ACGCTAACTG 
ACGCTAACTG 
ACGCTAGCTG 



GTTTGTAATC 
CTTTGTAATT 
GGCTCAAAAG 
GGCTCAAAAG 

AAG 

AAG 



ATCAATGTGG 
ATAAATGTAC 
ATGACATTAC 
ATGACATTGT 
AGTGATTAAC 
ATGACATTAC 
AGTGATTAAC 
AGTGATTAAC 
ATCAATGTAC 



AT . 



200 

TTTGTAATCT 
TTTGTAATCT 
CTTGTGAAAT 
CTTGTGAAAT 
CCCATGAATT 
CTTGTGAAAT 
CCTGTGAATT 
GGTGTGAATT 
TTTGTAATCT 
CATATATTCT 
CATATATTCT 
CATATATTCT 



250 

CTCCGCACGC TTGAGAATGG 

CTCCCCACCC TTGAGAGTGT 

CTC . . CCGCA CTGAG . . . . C 

CTC . . CCCGA CTGAG . . . . T 

CTC . . CCGCA CTGAG . . . . C 

CTC . . CCGCA CTGAG . . . . C 



HERV-H CMl 

HERV-H MP23 

HERV-H H13 

HERV-H HI 

HERV-H HU8 

HERV-H PA7 



■H Ii31 
•H HCM2 
■H 19 
■H MP20 
■H CMS 
■H MC16 

H CMl 
■H MP23 
•H H13 
•H HI 

H HU8 
•H PA7 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



HERV-H L31 
HERV-H HCM2 



HERV-H 
HERV-H 
HERV-H 
HERV-H 
HERV-H 
HERV-H 
HERV-H 
HERV-H 
HERV-H 
HERV-H 



19 

MP20 

CM3 

MC16 

CMl 

MP23 

H13 

HI 



HERV- 
HERV- 
HERV- 
HERV- 



HERV- 
HERV- 
HERV- 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



H L31 

H HCM2 

H 19 

H MP20 

H CM3 

H MC16 

H CMl 

H MP23 

H H13 

H HI 

H HU8 

H PA7 



H Ii31 

H HCM2 

H 19 

H MP20 

H CM3 

H MC16 

H CMl 

H MP2 3 

H H13 

H HI 

H HUB 

H PA7 
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TGCTTCTCCT GGCTCAG AAG CTC. . CCCCA CTGAG . . . .C 

TGCTTCTCCT GGCTCAG AAG CTC. .CCCCA CTGAG. . . .C 

CTCCCACCCT TAAGAAGGTT CTTTGTAATT CTCCCCACCC TTGAGAGTGT 

TCCCC CGCCC TTGAGAATGT 

TCCCC CGCCC TTGAGAATGT 

TCCCC CGCCC TTGAGAATGT 

251 300 
ACTTGGTGAG ATCCACCCCC TGCCTGCAAA GCATTGCCCC TAACTCCACC 
ACTTTGTGAG ATCCACACCT GCCCACCAGA GAACAAACCC CCTTTGACTG 
ACCTTGTGAC CCCTGCCTCT GCCCGCCAGA GAGCAACCCC CTCTTGACTG 
ACATTGTGAC CCCCACTCCT GCCCGCCAGA GAACAGCCCC CT.TTGACTG 
ACCTTGTGAC" CCCTGCCCCT GCCCACCAGA GAACAACCCC CT.TTGACTG 
ACCTTGTGAC CCCCACTCCT CCCCGCCACA GAACAACCCC CT.TTGACTG 
ACCTTGTGAC CCCCGCCCCT GCCCACCAGA GAACAACCCC CT.TTGACTG 
ACCTTGTGAC CCCCGCCCCT GCCCACCAGA GAACAGACCC CT.TTGACTG 
ACTTTGTGAG ATCCACCCCC TGCCGGCAAA ACATTGCTCC TAACCCAACC 

ACTTTGTA C 

ACTTTGTA C 

ACTTTGTA C 

301 350 

GCCTGTCCCA AAACCTATGA GAA . CTAATG ATA ATCCC.ACCA 

TAATTTTCCA TTACCTTCCC TAATCCTATA AAACGGCCCC ACCCC.ATCT 

TAATTTTCCT TTACCTACCT AAATCCTATA AAATGGCCCC ACTCCTATCT 

TAATTTTCCT TTATCTACCC AAATCCTATA AAACAGCCCC ACCTTTATCT 

TAATTTTCCA TTACTTTCCC AAATCCTATA AAACGGCCCC ACCCCTATCT 

TAATTTTCCA CTGCCCGCCC AAACCCTATA AAACGGTCCC ACCCC.ATCT 

TAATTTTCCA TTACCTTCCC AAATCCTATA AAACGGCCCC ACCCCTATCT 

TAATTTTCCA TTACCTTCCC AAATCCTATA AAACGGCCCC ACCCCTATCT 

GCCTA . CCCC AAACCTGTAA GAA . CTAATG ATA ATCC . . ACCA 

ACCTATCCC. AAACCTATAA GAA. CTAATG ATA ATCCT . ACCA 

ACCTATCCC. AAACCTATAA GAA . CTAATG ATA ATCC .. ACCA 

ACCTATCCC. AAACCTATAA GAA. CTAATG ATA ATCCT . ACCA 



351 

CACTTTGCTG 
CCCTTTGCTG 
CCCTTCGCTG 
CCCTTGGCTG 
CCCTTCGCTG 
CCCTTCGCTG 
CCCTTCGCTG 
CCCTTCGCTG 
CCCTTTGCTG 
CCCTTTGCTG 
CCCTTTGCTG 
CCCTTTGCTG 

401 

AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 



ACTCTCTTTC 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTC . . TTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 



CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 



C . . .AGACTC 
C . . . GGACTC 
C . . . GGACTC 
C . . . GGACTC 
C . . . GGACTC 
CTTCGGACTC 
C . . . GGACTC 
C . . . GGACTC 
C . . . AGAATC 
T. . .GGACTC 
T. . .GGACTC 
T. . .GGACTC 

425 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 



AGCCCGGCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 



400 

CACCCAGGTG 
CACCCAGGTG 
TACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CCCCCAGGTG 
CCCCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 



B. HERV-W LTR sequences 



HERV- 


-T47D- 


-W2 


TGTTGAGATG 


GGGGACTGAG 


HERV- 


-T47D- 


-W4 


TGTTGAGATG 


GGGGACTGAG 


HERV- 


-T47D- 


-W5 


TGTTGAGATG 


GGGGACTGAG 


HERV- 


-Wl 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-WIO 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-Wll 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-WIS 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W2 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W22 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W23 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W4 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W5 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W6 




TGTTGAGATG 


GGGGACTGAG 


HERV- 


-W8 




TGTTGAGATG 


GGGGACTGAG 



51 



HERV- 


-T47D- 


-W2 


CTAAGAATCC 


CTAAGCCTAG 


HERV- 


■T47D- 


-W4 


CTAAGAATTC 


CTAAGCCTAG 


HERV- 


-T47D- 


-W5 


CTAAGAATCC 


CTAAGCCTAG 


HERV- 


■Wl 




CTAAGAATCC 


C T AAGC C TAG 


HERV- 


-WIO 




CTAAGAATCC 


CTAAGCCTAG 


HERV- 


-Wll 




CTAAGAATCC 


CTAAGCCTAG 


HERV- 


-W18 




CTAAGAATCC 


CTAAGCCTAG 


HERV- 


-W2 




CTAAGAATCC 


CTAAGCCTAG 


HERV- 


-W22 




CTAAQAATCC 


CTAAGCCTAG 


HERV- 


-W23 




CTAAGAATCC 


CTAAGCCTAG 


HERV- 


-W4 




C TAAG AAT C C 


CTAAGCCTAG 


HERV- 


-W5 




CTAAGAATCT 


C TAAGCCTAG 


HERV- 


-W6 




CTAAGAATCC 


CTAAGCCTAG 


HERV- 


-W8 




TTAAGAATCC 


CTAAGCCTAG 








101 




HERV- 


-T47D- 


-W2 


ACGGGGCTCG 


CAACTTAGCT 


HERV- 


-T47D- 


-W4 


ATGGGGCTTG 


CAACTTAACT 


HERV- 


-T47D- 


-W5 


ACAGGGCTTG 


CAACTTAGCT 


HERV- 


-Wl 




ATGGGGCTTG 


CAACTTAGCT 


HERV- 


-WIO 




ACGGGGCTTG 


CAACTTAGCT 


HERV- 


-Wll 




ACGGGGCTTG 


CAATTTAGCT 


HERV- 


-WIS 




ACGGGGCTTG 


CAATTTAGCT 


HERV- 


-W2 




ACTAGGCTTG 


CAACTTAGCT 


HERV- 


-W22 




ATGGGGCTTG 


CAACTTAACT 


HERV- 


-W23 




ACTAGGCTTG 


CAACTTAGCT 


HERV- 


-W4 




ACGGGGCTTG 


CAATTTAGCT 


HERV- 


-W5 




ACAGGGCTTG 


CAACTTAGCT 


HERV- 


-W6 




ACTAGGCTTG 


CAACTTAGCT 


HERV- 


-W8 




ACGGGGCTTG 


CAACTCAGCT 








151 




HERV- 


-T47D- 


-W2 


CTCACTAAAA 


TGCTAATTAG 


HERV- 


-T47D- 


-W4 


CTCACTAAAA 


TGCTAATTAG 


HERV- 


-T47D- 


-W5 


CTCACTAAAA 


TGCTAATTAG 


HERV- 


-Wl 




CTTGCTAAAA 


TGCTAATTAG 


HERV- 


-WIO 




CTCACTAAAA 


TACTGATTAG 


HERV- 


-Wll 




CTCACTAAAA 


TGCTAATTAG 


HERV- 


-W18 




CTCACTAAAA 


TGCTAATTAG 


HERV- 


-W2 




CTTGCTAAAA 


TGCTAATTAG 


HERV- 


-W22 




CTTGCTAAAA 


TGCTAATTAG 


HERV- 


-W23 




CTTGCTAAAA 


TGCTAATTAG 


HERV- 


-W4 




CTCACTAAAA 


TGCTAATTAG 


HERV- 


-W5 




CCCGCTAAAA 


TGCTAATTAG 


HERV- 


-W6 




CTTGCTAAAA 


TGCTAATTAG 


HERV- 


-W8 




CCCGCTAAAA 


TGCTAATTAG 
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50 

AGACAGGACT AGCTGGATTT CCTAGGCCGA 

AGACAGGACT AGCTGGATTT CCTAGGCCGA 

AGACAGGACT AGCTGGATTT CCTAGGCCGA 

AGACAGGACT AGCTGGATTT CCTAGGCCAA 

AGACAGGACT AGCTGGATTC CCTAGGCCGA 

AGACAGGACT AGTTGGATTT CCTAGGCTGG 

AGACAGGACT AGTTGGATTT CCTAGGCCGG 

AGACAGGACT AGCTGGATTT CCTAGGCCAA 

AGACAGGACT AGCTGGATTT CCTAGGCTGA 

AGACAGGACT AGCTGGATTT CCTAGGCTGA 

AGACAGGACT AGTTGGATTT CCTAGGCTGG 

AGACAGGACT AGCTGGATTT CCTAGGCCAA 

AGACAGGACT AGCTGGATTT CCTAGGCCAA 

AAACAGGACT AGCAGGATTT CCTAGGCCGA 

100 

CTGGGAAGGT GACCGCATCC ACCTTTAAAC 

CTGGGAAGGT GACCGCATCC ATCTTTAAAC 

CTGGGAAGGT GACCACATCC ACCTTTAAAC 

CTGGGAAGGT GACTACACCC ACCTTTAAAC 

CTGGGAAGGT GACCACATCC ACCTTTAAAC 

CTGGGAAATT GACCACGTCC ACCTTTAAAC 

CTGGGAAATT GACCACGTCC ACCTTTAAAC 

CTGGGAAGGT GACTACACCC ACCTTTAACC 

CTGGGAAGGT GACCGCATCC ATCTTTAAAC 

CTGGGAAGGT GACTACACCC ACCTTTAACC 

CTGGGAAATT GACCACGTCC ACCTTTAAAC 

CTGGGAAGGT GACCACATCC ACCTTTAAAC 

CTGGGAAGGT GACTACACCC ACCTTTAACC 

ATGGGAAGTT GACCACATCC ACCTTTAAAC 

150 

CACACCCAAC CAATCAGGTA GTAAAGAGGG 

CATATCTGAC CAATCAGGTA GTAAAGAGAG 

CACACTTGAC CAGTCAGGTA GTAAAGAGAG 

CACACCCAAC CAATCAGGTA GTAAAGAGAG 

CATACCCAAC AAATCAGGTA GTAAAGAGAG 

CACACCCGAC CAATCAGGTA GTAAAGGGAG 

CACACCCGAC CAATCAGGTA GTAAAGGGAG 

CACACCCGAC CAATCAGGTA GTAAAGAGAG 

CATATCTGAC CAATCAGGTA GTAAAGAGAG 

CACACCCGAC CAATCAGGTA GTAAAGAGAG 

CACACCCGAC CAATCAGGTA GTAAAGGGAG 

CACACCCGAC CCATCAGGTA AGAAAGAGAG 

CACACCCGAC CAATCAGGTA GTAAAGAGAG 

CACACCCGAC CCATCAGGTA AGAAAGAGAG 

200 

GCAAAGACAG GAGGTAAAGA AATAGCCAAT 

GCTAAAACAG GAGGCAAAGA AGTAGCCAAT 

GCTAAAACAG GAGGTAAAGA AATAGACAAT 

GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

GCGAAAACAG GAGGTAAGGA AACAGCCAGT 

GGAAAAACAG GAGGTAAAGA AGTAGCCAAT 

GGAAAAACAG GAGGTAAAGA AGTAGCCAAT 

GCAAAAACAG GAGGTAGAGA AATAGCCAAT 

GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

GGAAAAACAG GAGGTAAAGA AGTAGCCAAT 

GCAAAAACAG GAGGTAAAGA AATAGTCAAT 

GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

GCAAAAACAG GAGGTAAAGA AATAGCCAAT 







201 


HERV-T47D- 


-W2 


CATCTATTGC 


HERV-T47D- 


-W4 


CATCTGTTGC 


HERV-T47D- 


-W5 


CATCTATCAC 


HERV-Wl 




CATCTATCGC 


HERV-WIO 




CATCTATCGC 


HERV-Wl 1 




CATCTATCGC 


HERV-Wl 8 




CATCTATCGC 


HERV-W2 




CATCTATCGC 


HERV-W22 




CATCTATCGC 


HERV-W23 




CATCTATCGC 


HERV-W4 




CATCTATCGC 


HERV-W5 




CATCTATTGC 


HERV-W6 




CATCTATCGC 


HERV-W8 




CATCTATTGC 






251 


HERV-T47D- 


-W2 


CCAAGTCTTC 


HERV-T47D- 


-W4 


CCAGGCATTC 


HERV-T47D-W5 


CCAGGCATTC 


HERV-Wl 




TCAGGCATTC 


HERV-WIO 




TCAGGCATTC 


HERV-Wl 1 




C C AGGC ATTC 


HERV-Wl 8 




CCAGGCATTC 


HERV-W2 




CCAAGCATTC 


HERV-W22 




TCAGGCATTC 


HERV-W23 




TCAGGCATTC 


HERV-W4 




CCAGGCATTC 


HERV-W5 




CCAGGCATTC 


HERV-W6 




TCAGGCATTC 


HERV-W8 




CCAGGCATTC 






301 


HERV-T47D- 


■W2 


TATGGGAGCT 


HERV-T47D- 


-W4 


TATGGGAGCT 


HERV-T47D- 


■W5 


AGCT 


HERV-Wl 




TATGGGAGCT 


HERV-WIO 




TATGGGAGCT 


HERV-Wll 




TATGGGAGCC 


HERV-W18 




TATGGAAGCT 


HERV-W2 




TATGGGAGCT 


HERV-W22 




TATGGGAGCT 


HERV-W23 




TATGGGAGCT 


HERV-W4 




TATGGAAGCT 


HERV-W5 




TATGGGAGCT 


HERV-W6 




TATGGGAGCT 


HERV-W8 




TATGGGAGCT 
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CTGAGAGCAC AGCAGGAGGG 
CTGACAGCAC AGCAGGAGGG 
CTGAGAGCAC AGTGGGAGGG 
CTGACAGCAC AAGGGGCGGG 
CTGACAGCAC AAGGGGCGGG 
CTGAGAGCAC AACAGGAGGG 
CTGAGAGCAC AACAGGAGGG 
CTGAGAGCAC AGCAGGAGGG 
CTGACAGCAC AAGGGGCGGG 
CTGACAGCAC AAGGGGCGGG 
CTGAGAGCAC AACAGGAGGG 
CTGAGAGCAC AGCGGGAGGG 
CTGACAGCAC AAGGGGCGGG 
CTGAGAGCAC AGCGGGAGGG 



GAGCCGGCAA TGGCTACCTT 
GAGCCAGCTA CAGCTACCCT 
GAGCCAGCAA CAGCAACCCC 
AAGCCAGCAA TGGCTACCCA 
AAGCCAGCAA TGGCTACCCA 
AAGCCAGCGG TGGCTACCCT 
AAGCCAGCGG TGGCTACCCT 
GAGCCAGCAA TGGCTACCCT 
AAGCCAGCAA TGGCTACCCA 
AAGCCAGCAA TGGCTACCCA 
AAGCCAGCGG TGGCTACCCT 
GAGCCGGCAA CGACTACCCT 
AAGCCAGCAA TGGCTACCCA 
GAGCCGGCAA CGACTACCCT 



CTGTTTTCAC TCTATTAAAT 
CTGTCTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTATTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 
CTGTTTTCAC TCTATTAAAT 



250 

ACAATGATCG GGATATAAAC 

ACAATGATCG GGATATAAAC 

ACAATGATCG GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCG GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

ACAATGATCA GGATATAAAC 

300 

CTTTGGGTCC CCTCCCTTTG 

CTTTGGGTCC CCTCCCTTTG 

CTTTGGG 

CTTTGGGTCC CCTCCCATTT 

CTTTGGGTCC CCTCCCATTT 

CTTTGGGTCC CCTCCCTTTG 

CTTTGGGTCC CCTCCCTTTG 

CTTTGTGTCC CCTCCCTTTG 

CTTTGGGTCC CCTCCCATTT 

CTTTGGGTCC CCTCCCATTT 

CTTTGGGTCC CCTCCCTTTG 

CTTTGGGTCC CCTCCCTTTG 

CTTTGGGTCC CCTCCCATTT 

CTTTGGGTCC CCTCCCTTTG 

343 

CTTGCAACTG C. . 
CTTGCAACTG C. . 
CTTGCAACTG C. . 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
CTTGCAACTG CAA 
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C. HERV-K LTR sequences 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K3 0 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K3 0 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



. 50 



GCGACCGGT : GGATC : CCGG GCCCGCGG : T ACCGTCGACT 
GCGACCGGT : GGATC : CCGG GCCCGCGG : T ACCGTCGACT 
GCGACCGGT: GGATC : CCGG GCCCGCGG : T ACCGTCGACT 
GCGACCGGT : GGATC : CCGG GCCCGCGG : T ACCGTCGACT 

GTC CCACCTCCAG CCCTAAGGCG GTTTTTCCCT ATCTCAGTAG 

AGTAG 



: GCAGAATTC 
: GCAGAATTC 
: GCAGAATTC 
: GCAGAATTC 



51 

ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 

101 

AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 

151 

TCCTCTTATA 
TCCTCTTATA 
TCCTCTTATA 
TCCTCTTATA 
TCCTCTTATA 
TCCTCTTTTA 

201 

GGGGGACGGT 
GGGGGACGGT 
GGGGGATGGT 
GGGGGACGGT 
GGGGGACGGT 
GGGGGACGGT 

251 

ACATGGGGAG 
ACATGGGGAG 
ACATGGGAAG 
ACATGGGGAG 
ACATGGGGAG 
ACATGGGGAG 

301 

TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 

351 

GTGATGACTC 
GTGATGACTC 
GTGATGACTC 
GTGATGACTC 
GTGATGACTC 
GTGATGACTC 



CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 



CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 



CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 



CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 



AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 



TTATACCGAG 
TTATACCGAG 
TTATACCGAG 
TTATACCGAG 
TTATACCGAG 
TTATACCGAG 



CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 



TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 



CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 



CAATACCTGG 
CAATACCTGG 
CAATACCTGG 
CAATACCTGG 
CAATACCTGG 
CAATACCTGG 



CGCAGTTTTT GTGT : CCTGG 

CGCAGTTTTT GTGT : CCTGG 

CGCAGTTTTT GTGT : CCTGG 

CGCAGTTTTT GTGT : CCTGG 

CGCAGTTTTT GTGTCC : TGG 

TGCAGTTTTT GTGTCCCTGG 



TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 



TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 



ACATTCCATT 
ACATTCCATT 
ACATTCCATT 
ACATTCCATT 
ACATTCCATT 
ACATTCCATT 



CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 



CCCTTTACGG 
CCCTTTACGG 
CCCTTTACGG 
CCCTTTACGG 
CCCTTTACGG 
CCCTTTACAG 



AGGCCATATT 
AGGCCATATT 
AGGCCATATT 
AGGCCATATT 
AGGCCATATT 
AGGCCATATT 



CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 



GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 



AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 



100 

GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 

150 

AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 

, 200 

GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 

250 

TCAGACTATC 
TCAGACTATC 
TCAGACTATC 
TCAGACTATC 
TCAGACTATC 
TCAGACTATC 

300 

CAGAGGTCCC 
CAGAGGTCCC 
CAGAGGTCCC 
CAGAGGTCCC 
CAGAGGTCCC 
CAGAGGTCCC 

350 

TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 

400 

TTAACAAAGC 
TTAACAAAGC 
TTAACAAAGC 
TTAACAAAGC 
TTAACAAGGC 
TTAACAAAGC 
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HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-Kl 0 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-Kl 0 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K3 0 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-Kl 

HERV-K30 

HERV-KIO 



HERV-Kl 
HERV-K30 



401 

ACATCCTGCA 
ACATCCTGCA 
ACATCCTGCA 
ACATCCTGCA 
ACATCCTGCA 
ACATCCTGCA 



CCGCCCTTAA 
CCGCCCTTAA 
CTGCCCTTAA 
CCGCCCTTAA 
CCGCCCTTAA 
CCGCCCTTAA 



TCCATTCAAC 
TCCATTCAAC 
TCCATTCAAC 
TCCATTCAAC 
TCCATTCAAC 
TCCATTCAAC 



CCTGAGTTGA 
CCTGAGTTGA 
CCTGAGTTGA 
CCTGAGTTGA 
CCTGAGTTGA 
CCTGAGTTGA 



451 

GTTTCAGAGA 
GTTTCAGAGA 
GTTTCAGAGA 
GTTTCAGAGA 
GTTTCAGAGA 
GTTTCAGAGA 



GCACGGGGTT 
GCACGGGGTT 
GCACGGGGTT 
GCACGGGGTT 
GCACGGGGTT 
GCACGGGGTT 



GGGGGTAAGG 
GGGGGTAAGG 
GGGGGTAAGG 
GGGGGTAAGG 
GGGGGTAAGG 
GGGGGTAAGG 



TCATAGATTA 
TCATAGATTA 
TCATAGATTA 
TCATAGATTA 
TCATAGATTA 
TCATAGATTA 



450 

CACAGCACAC 
CACAGCACAC 
CACAGCGCAC 
CACAGCACAC 
CACAGCACAC 
CACAGCACAT 



550 

ACAGAATCTC 
ACAGAATCTC 
ACAGAATCTC 
ACAGAATCTC 
ACAGAATCTC 
ACAGAATCTC 



501 

AAGGCAGAAG 
AAGGCAGAAG 
AAGGCAGAAG 
AAGGCAGAAG 
AAGGCAGAAG 
AAGGCAGAAG 



551 , 

TACTTCTTTC 
TACTTCTTTC 
TACTTCTTTC 
TACTTCTTTC 
TACTTCTTTC 
TACTTCTTTC 



AATTTTTCTT AACACATAAC AAAATGGAGT 

AATTTTTCTT AACACATAAC AAAATGGAGT 

AATTTTTCTT AACACATAAC AAAATGGAGT 

AATTTTTCTT AACACATAAC AAAATGGAGT 

AATTTTTCTT AACACATAAC AAAATGGAGT 

AATTTTTCTT AGCACATAAC AAAATGGAGT 



TACACAGACA 
TACACAGACA 
TACACAGACA 
TACACAGACA 
TACACAGACA 
TACACAGACA 



CAGTAACAAT 
CAGTAACAAT 
CAGTAACAAT 
CAGTAACAAT 
CAGTAACAAT 
CAGTAACAAT 



CTGATCTCTC 
CTGATCTCTC 
CTGATCTCTC 
CTGATCTCTC 
CTGATCCCTC 
TTGATCTCTC 



550 

CTCCCATGTC 
CTCCCATGTC 
CTCCCATGTC 
CTCCCATGTC 
CTCCCATGTC 
CTCCTATGTC 



600 

TTGCTTTTCC 
TTGCTTTTCC 
TTGCTTTTCC 
TTGCTTTTCC 
TTGCTTTTCC 
TTGCTTTTCC 



601 , 

CCACATTTCC 
CCACATTTCC 
CCACATTTCC 
CCACATTTCC 
CCACATTTCC 
CCACATTTCC 



651 , 

AGT 

CATCTAGAGG 



. 650 



CCCTTTTCTT TTCG 
CCCTTTTCTT TTCGA 
CCCTTTTCTT TTCGACAAA 
CCCTTTTCTT TTCGACAAAA 
CCCTTTTCTT ATCCATCACA 
CCCTTTTCTT TTCGACAAAA 



CCGCCATrCT CGAGATC : TG 
CTGGCGGCCG CTCGAGCATG 
CCGCCATC 



GCCCAATTCG CCCTATAGTG 



«.:3.:rf.,i,M..,6.„::.B ..o:9i/3gLfaif-f665 
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HERV-K-T47D-5 'LTR 

TGTGGGCGAAGGATTACCCAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATATAGAAAATAGCTAG 
AATAAGAATAGTTATAATAAAAATTAGATATACACATGATCATGGACATTACCAATCATTACTACAAACATTGTTAATCA 
TTAGCTTTTAATATTACTCTTTGTTTTATTACTAATATAACCAAGGAATAACCGGTAGCATACGGTCAGGTGCTGAAGGG 
ACATTGTGAGAAGTGACCTAGAAGGCAAGAGGTGAGCCTTCTGTCACGCCTGCATAAGGACAGCTTGAGGGCTCCTTGGT 
CAAGCTGTAACACCAGTGCCTGGGAAGGCACCGTTACTTAGCAGACCATGAAAGGGAGTCTCCATTCCTTGGAGGAGTCA 
GGGAAACACTATGCTCCACCAGCTTCTTGTGTATCCAGCCCTGCCCACAGTCATCCAGAGGCATAAACCCCTCCCTGTGG 
TGCTGTGCTTCAATGGCCATGCTTCTTGTCCACTTTCATGTTCCTCCTGTACTCCTGGTTCCTCTTTGAAGTTCGTAGAA 
GATAATGGTAGAAGAAATAGTGAAAGTCTTTGATCTTTCTTATAAGTGCATAGAAGAAAACACTGATGTATGCCTGCCTT 
CCCTCTCTGCTTCAGCTACCTAAAAGGAAAGGCCCCCTTTCCCATGATCACATGACTTGCCTGACCTTATCAATCACTTG 
GAGGACTCACCCTCCTTACCCTGTCCCTTTGTCTTGTATGCAATAAATATCAGCACGCCCAGCCATTCGGGGCCACTACT 
GGTCTCCGCAACTTGGTGGTAGTGGTACCCTGGGCCCAGCTGTTTTCTCTTTATCTCTTTTGTCTTGTGTCTTTATTTCT 
TACAATCTCTCATCTCTGCACATGGGGAGAACACCGGCAAAGCCCGTAGGGCTGGACCTTACA 



L48-LTR {U3-R) 

TGTGGGCGGAAGAGTACCTAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATAGAATA 
AGAATAGTCATAATACAAATTAGATACAGCGATGATCATGAACAATTATCCATCATTATTATAAACATTATTAATCATTA 
GCTTTTAATATTACTCTGTTGCATTAATAATATAACCTAGGAATAACCGGCAGGTATAGGGTCAGGTGCTGAAGGGACAT 
TGTGAGAAGTGAATAGAAGGCAAGAGGGGAGCCTTCTGTCATGCCCGCATAAGGGCCGCTTGAGGGCCCCTTGGTCAAGC 
GGTAACGCCAGTGTCTGGGAAGGCACCCGTTACTGAGCAGACCGGGAAAGGGAGTCTCCTTTCCTTGGAGGAGTCAGGGA 
ACGCTCTGCTCCACCAGCTTCTTGTGGGAGGCTGGATGTTACCCAGGCCTGCCTGCAGTCATCCGGAGGCCTGAACCCCT 
CCCTGTGGTGCTTCAATGGTCACGTTCCTTGTCCACTTTCATGCTCCTTCCGTACTCCTGGTTCCTCTTTGAAGTTCGTA 
GTAGATAGCGGTAGAAGAAATAGTGAAAGTCTTAAAGTCTTTGATCTTATAAGTTCATAGAAGAAAACGCTGATGCCTGC 
CGCCTTCTCTCTCTGCTTCAGCTACCTAAGAGGGAAGGGCCCGCTGTCCTGTGATCAGGTGACTTGCTTCACCTTGTCAA 
TCACTTAGAAGACTGACCCTCCTTATCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGCGAGCCCAGCCGTTCAGGGC 
CACTACCGGTCTCCGTGTCTTTGTGGTAGTGGTCCCCGGGCCCAGCTGTTTTCTCTTT 



L5-LTR (U3-R) 

TGTGGGTGGAGGATTACCCAGGTGCCAAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAAAAAATAGAATA 
AGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTAGCAATCACTATTAATCTTTAGCTTTTAATATT 
ACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAGGGACATTGTGTGAAGT 
GACCTGGAAGGCAAGAGGTGAGCCCTCTGTCACGCCCACATAAGGGCCGCTTGAGGGCTCCTTGGTCAAGTGGTAACGCC 
AGTGTC TGGGAATGC AGO C GTTAATTAGC AGAC CGCG AAAGGGAGTCTCC TTTCCTTGG AAGAGTTGGGGAAC AC TC TGC 
TCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCGCGCAGTCATCCGGAGGCTTAAACCCTTCCCTGTGGT 
GCTGTGCTTCAATGGTCCCACTCCTTGTCCACTTTCATGCTCCTCCCGTACTCCTGGTTCCTCTTTGAAGAGCGCAGTAG 
ATAGCGGTAGAAGAAATAGTGAAAGTCTTAAAGTCTTCGATCTTTCTTACAAGTGCAGAGAAGAAAACGCTGACATATGC 
TGCCTTCCCTCTCTGCTTCGGCTACCTAAAAGGGAAGGGCCGCCTATCCTGTAATCACATGACTTGCTTCACCTTGTCAA 
TCACTTAGAAGATTCACTCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGACCCCAGCCGTTCAGGGC 
CACTACTGGTCTCCGCGTCTTGATGGTAGTGGTCACCCCGGCCC 



L50-LTR (U3-R) 

TGTGGGTGGAGGATTACCCAGGTGCCAAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAAAAAATAGAATA 
AGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTAGCAATCACTATTAATCTTTAGCTTTTAATATT 
ACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAGGGACATTGTGAGAAGT 
GACCTGGAAGGCAAGAGGTGAGCCCTCTGTCACGCCCACATAAGGGCCGCTTGAGGGCTCCTTGGTCAAGTGGTAACGCC 
AGTGTCTGGGAATGCACCCGTTAATTAGCAGACCGCGAAAGGGAGTCTCCTTTCCTTGGAAGAGTTGGGGAACACTCTGC 
TCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCGCGCAGTCATCCGGAGGCTTAAACCCTTCCCTGTGGT 
GCTGTGCTTCAATGGTCCCACTCCTTGTCCACTTTCATGCTCCTCCCGTACTCCTGGTTCCTCTTTGAAGAGCGCAGTAG 
ATAGCGGTAGAAGAAATAGTGAAAGTCTTAAAGTCTTCGATCTTTCTTACAAGTGCAGAGAAGAAAACGCTGACATATGC 
TGCCTTCCCTCTCTGCTTCGGCTACCTAAAAGGGAAGGGCCGCCTATCCTGTAATCACATGACTTGCTTCACCTTGTCAA 
TCACTTAGAAGATTCACCCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGACCCCAGCCGTTCAGGGC 
CACTACTGGTCTCCGCGTCTTGATGGTAGTGGTCACCCCGGCCCAGGTGTTTTTTCTTT 



L9-LTR (966 nt ) 
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TGTGGGTGGAGGATTACCCAGGTGCCGAGGCAA.GAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATGGTTAG 
AATAAGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTATCAATCATTATTATAAACATTATTAATC 
ATTAGCTTTTAATATTACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAA 
GGACATTGGGAGAAGTGACCTAGAAGGCAAGAGGTGAGTCTTCTGTCACGCCCGCATAAGGGTTGCTTGAGGGCTCCTTG 
GTCAAGTGGTAACGCCGGTGTCTGGGAAGGCACCTGTTACTTAGCCGACCACGAAAGGGAGTCTCCTTTCCTTGGAGGAG 
TCAGGGCGCACTCTGCTCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCCCGCAGTCATCCGGAGGCCTA 
AACCCCTCCCTGTGGTGCTGTGCTTCAATGGGCACACTCCTCGTCCACTTTCATGTTCCTCCCATACTCCTGGTTTCTCT 
TTGAAGTTCGTAGTAGATAGTGGTAGAAGGAATAGGGAAAATCTTAAAGTGTTTGATCTTTCTTATAAGTGCATAGAAGA 
AAACGCTGACATATGCTGCCTTCTCTGTCTGCTTCAGCTACCTAAGAGGGAAGGGCCCCCTGTCCAGTGATCACGTGACT 
TGCTTCACCTTGTCAATCACTTAGAAGATTCACCCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGCA 
CCCAGCCTTTCGGGGCCACTTACCGGTCTCCACGTCTTGGTGGTAGTGGTCCCCCGGGCCCAGCTGTTTTCTCTTTATCT 
CTTTGTCTTGTGTCTTATTTATTACAATCTCTCGTCTCCGCACACAGGGAGAACACCCGCTAAGCTCCGTAGGGCTGGAC 
CCTACA 



LS-IjTR (938 nt) 

TGTGGGTGGAGGATTACCCAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATGGTTAG 

AATAAGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTATCAATCATTATTATAAACATTATTAATC 

ATTAGCTTTTAATATTACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAG 

GGACATTGGGAGAAGTGACCTAGAAGGCAAGAGGTGAGTCTTCTGTCACGCCCGCATAAGGGTTGCTTGAGGGCTCCTTG 

GTCAAGTGGTAACGCCGGTGTCTGGGAAGGCACCTGTTACTTAGCCGACCACGAAAGGGAGTCTCCTTTCCTTGGAGGAG 

TCAGGGCACACTCTGCTCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCCCGCAGTCATCCGGAGGCCTA 

AACCCCTCCCTGTGGTGCTGTGCTTCAATGGGCACACTCCTCGTCCACTTTCATGTTCCTCCCATACTCCTGGTTCCTCT 

TTGAAGTTCGTAGTAGATAGTGGTAGAAGGAATAGGGAAAATCTTAAAGTGTTTGATCTTTCTTATAAGTGCATAGAAGA 

AAACGCTGACATATGCTGCCTTCTCTGTCTGCTTCAGCTACCTAAGAGGGAAGGGCCCCCTGTCCAGTGATCACGTGACT 

TGCTTCACCTTGTCAATCACTTAGAAGATTCACCCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGCA 

GCCAGCCTTTCGGKKCACTTACCGGTCTCCACGTCTTGGTGGTAGTGGTCCCCCGGCCCAGCTGTTTTCTCTTTATCTCT 

TTGTCTTGTGTCTTATTTATTACAATCTCTCGTCTCCGCACACAGGGAGAACACCCGC (Abbruch 2 6 nt vor 

Ende der LTR) 



L49-LTR = Ij20-LTR (963 nt) 

TGTGGGCGAAAGATTACCTAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATAGTTAA 
AATAAGAATAGTTATAATACAAATTAGATATAGAGATGATCATGGACAATTATCAATCATTATTATAAACATTAATCATT 
AGCTTTTAATATTACTCTTTGTTGCTTTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGTTGACGGGA 
TATTGTGAGAAGTGACCTAGAAGGCAAGAGGTGAGCCTTCTGTCACGCCCACATAAGGGCCGCTTGAGGGCTCTTTGGTC 
AAGTGGTAACGCCAGTGTCTGTGAAGGCACCTGTTACTTAGCAGACCGCGAAAGGGAGTCTCCTTTCCTTGGAGGAGTCA 
GGGAACACTCTGCTCCACCAGCTTCTTGTGGAAGGCTGGATATTATCTAGGCCTGCCCGCAGTCATCTGGAGGCCTAAAC 
CCCTCCCTGTGGTGCTGTGCTTCAGTGGTCACTCTCCTTGTCCACTTTCATGTTCCTCCCGTACTCCTGGTTCCTCTTTG 
AAGTTCGTAGTAGATAGCAGTAGAAGAAATAGTGAAAGTCTTAAAGTATTTGATCTTTCTTATAAGTGCATAGAAGAAAA 
CGCTGACATATGCTGCCTTCTCTATCTCTGCGGTGGCTACCTAAAAGGGAAGGGCCCCCTGTCCCATGATCATGTGACTT 
GCTTCACCTTATCACTTAGAAGATTCATCCTCCTTACCCTGCGCCCCCTCGTCTTGTATGCAATAAATATCAGCACGCCC 
AGTCGTTTGAGGCCACTGCCGGTCTCCGCGTCTTGGTGGTAGTGGTCCCCCGGGCCCAGCTATTGTCTCTTTATCTCTTT 
GTCTTGTGTCTTTATTTATTACAATCTCTTGTCTCTGCACACAGGGAGAACACCTGCTAAGCCCCGTAGGACTGGACCCT 
ACA 



HERV-IP-T47D 

TGTTCAATTCTTTGCCTTCTACTTTTAAACTTAACTTCCTCATAAAGCAACCTTTTTCAATCACCTGCTCCACTCTGACT 
CATTCTGATCACCTGCTCCACCCTGACTCATTCCGATCACCTGATCCACTGTGACTCATTCCGATTACCCGCTCCACCCT 
GACTCATTCTGATTCTGATTTCCTGCTCTGCCATAACCATTTTTCCCGCCAAACCACTCACCCTGTCACTCTCTTTAAAT 
TAGCCAATTGGAATTAGTTTAGCCTGTGCGGTCTAACCCTAGCCAATAGGGGACTGACACAGCAGCAGGGGCCACATGTG 
TCAGGAATAAGACCCCCTTCCCCTCCCTGTCCAGATGTGTGCTCACCATTGCTCCATCTGTGAGGGCACACCCTTCTATA 
GAAGTAAATTGCCTTGCTGAGAAGAAAAAAAAGAACATTTTATATTCAAGTCCTATTTCTTTTGCTGCACCGAAACTTTA 
TTTATAACA 
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518RecyPGT/Pl6 3 l AUG 20Q1 

SEQUENCE LISTING 

<110> GSF - Forschungszentrum fiir Umwelt und Gesundheit GmbH 

<120> Retroviral expression vectors on the basis of 
HERV-LTR- sequences 

<130> P12088 

<140> PCT/EPOO/02064 
<141> 2000-03-09 

<150> DE 199 10 650.9 
<151> 1999-03-10 

<160> 47 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 375 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 1 

tgtcaggcct ctgagcccaa gctaagccat catatcccct gtgacctgca cgtatacatc 60 
cagatagcct gaagcaactg taaaaatatc cttaactgat gacattccaa tattgtgatt 120 
tgtttctgcc ctaccctgac tgatcaatgt gctttgtaat ctcccccacc cttcagaagg 180 
ctctttgtaa tcctccccac ccttgagaat ggacttggtg agatccaccc cctgcctgca 240 
aagcattgcc cctaactcca ccgcctgtcc caaaacctat gagaactaat gataatccca 300 
ccacactttg ctgactctct ttccagactc agcccggctg cacccaggtg aaataaacag 360 
ccatgttgct cacat 375 



<210> 2 
<211> 400 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 2 

tgtcaggcct ctgagcccaa gccaggccat cgcatcccct gtgacttgca cgtatacatc 60 
cagatggcct aaagtaactg aagatccaca aaagaagtaa aaacagcctt aactgatgac 12 0 
attccaacat tgtgatttgt tcctgcccca ccctaactga taaatgtact ttgtaatctc 180 
ccccaccctt aagaaggtcc tttgtaattc tccccaccct tgagagtgta ctttgtgaga 240 
tccacacctg cccaccagag aacaaacccc ctttgactgt aattttccat taccttccct 300 
aatcctataa aacggcccca ccccatctcc ctttgctgac tctcttttcg gactcagccc 360 
gcctgcaccc aggtgaaata aacagccatg ttgctcacat 400 



<210> 3 
<211> 361 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 3 

tgtcaggcct ctgagcccaa 
cagatggcct gaagtaactg 
aactgatgac attaccttgt 
gcactgagca ccttgtgacc 
aattttcctt tacctaccta 
ctctcttttc ggactcagcc 



gctaagccat catatcccct 
aagaatcaca aaagaagtga 
gaaattcctt ctcctggctc 
cctgcctctg cccgccagag 
aatcctataa aatggcccca 
cgcctgtacc caggtgaaat 



gcgacctgca catatacatc 60 
aaatggcctg ttcctgcctt 120 
atcctggctc aaaagctccc 180 
agcaaccccc tcttgactgt 240 
ctcctatctc ccttcgctga 300 
aaacagccat gttgctcaca 360 
361 
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<210> 4 
<211> 372 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 4 

tgtcaggcct ctgagcccaa gctaagccat 
cagatggccg gttcctgcct taactgatga 
tgttcgtgcc ttaactgatg acattgtctt 
tcaaaagctc cccgactgag tacattgtga 
cctttgactg taattttcct ttatctaccc 
cccttggctg actctctttt cggactcagc 
tgttgctcac at 



catatcccct gtgacctgca cgtacacatc 60 

cattccacca cgaaagaagt gaaaatgacc 12 0 

gtgaaattcc tcctcctggc tcatcctggc 180 

cccccactcc tgcccgccag agaacagccc 240 

aaatcctata aaacagcccc acctttatct 3 00 

ccgcctgcac ccaggtgaaa taaacagcca 3 60 

372 



<210> 5 
<211> 349 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 5 

tgtcaggcct ctgagcccaa gccaagccat cgcatcccct gtgacttgca cgtgtatgcc 60 

cagatggcct gaagtaactg aagaatcaca aaagaagtga aaaggccctg ccccacctta 120 

actgagtgat taaccccatg aatttccttc ccctggctca gaagctcccc cactgagcac 180 

cttgtgaccc ctgcccctgc ccaccagaga acaaccccct ttgactgtaa ttttccatta 240 

ctttcccaaa tcctataaaa cggccccacc cctatctccc ttcgctgact ctcttttcgg 300 

actcagcccg cctgcaccca ggtgaaataa acagccatgt tgctcacat 349 



<210> 6 
<211> 324 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 6 

tgtcaggcct ctgagcccaa gcctgcacgt atacatccag atgaagcaag tgaagaatca 60 
caaaagaagt gaaaatggcc ggttcctgcc ttaactgatg acattacctt gtgaaattcc 120 
ttctcctggc tcagaagctc ccccactgag caccttgtga cccccactcc tccccgccac 180 
agaacaaccc cctttgactg taattttcca ctgcccgccc aaaccctata aaacggtccc 240 
accccatctc ccttccctga ctctcttttc ttcggactca gcccgcctgc acccaggtga 300 
aataaacagc catgttgctc acat 324 



<210> 7 
<211> 393 
<212> DNA 

<213> Hiiiaan endogenous retrovirus 



<400> 7 t 

tgtcaggcct ctgagcccaa gccaagccat cgcatcccct gtgacttgca cgtatacgcc 60 

cagatggcct gaagtaactg aagaatcaca aaagaactga aaaggccctg ccccgcctta 120 

actgatgaca ttccaccatg gtgatttgtt cttgccccac cttaactgag tgattaaccc 180 

tgtgaatttg cttctcctgg ctcagaagct cccccactga gcaccttgtg acccccgccc 240 

ctgcccacca gagaacaacc ccctttgact gtaattttcc attaccttcc caaatcctat 300 

aaaacggccc cacccctatc tcccttcgct gactctcttt tcggactcag cccgcctgcc 360 

cccaggtgaa ataaacagcc atgttgctca cat 393 



<210> 8 
<211> 393 
<212> DNA 

<213> Human endogenous retrovirus 
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<400> 8 

tgtcaggcct ctgagcccaa gccaagccat cgcatcccct gtgacttgca cgtatacgcc 60 
cagatggcct gaagtaactg aagaatcaca aaagaactga aaaggccctg ccccgcctta 12 0 
actgatgaca ttccaccatg gtgatttgtt cttgccccac cttaactgag tgattaaccc 180 
tgtgaatttg cttctcctgg ctcagaagct cccccactga gcaccttgtg acccccgccc 240 
ctgcccacca gagaacagac ccctttgact gtaattttcc attaccttcc caaatcctat 300 
aaaacggccc cacccctatc tcccttcgct gactctcttt tcggactcag cccgcctgcc 360 
cccaggtgaa ataaacagcc atgttgctca cat 393 

<210> 9 
<211> 388 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 9 

tgtcaggcct ctgagcccaa gctaagccat catatcccca gggacctgca cgtatacatc 60 
cagatggcct gaagcaactg aagatccaca aaggaagtga aaatagcctt aactgatgac 120 
attccaccat tgtgatttgt ttctgcccca tcctaactga tcaatgtact ttgtaatctc 180 
tcccaccctt aagaaggttc tttgtaattc tccccaccct tgagagtgta ctttgtgaga 240 
tccaccccct gccggcaaaa cattgctcct aacccaaccg cctaccccaa acctgtaaga 300 
actaatgata atccaccacc ctttgctgac tcttttcaga atcagcccgc ctgcacccag 360 
gtgaaataaa cagccatgtt gctcacat 388 

<210> 10 
<211> 314 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 10 

tgtcaggcct ctgagcccaa gctaagccat caaatcccct gtgacctgca cgtgtacatc 60 
cagatgacct gaagcaactg aagatccaca aaagaagtga aagtagcctt aactgatgac 12 0 
attccaccat tgtgatttgt tcctgcccca cgctaactga taccatatat tcttcccccg 180 
cccttgagaa tgtactttgt acacctatcc caaacctata agaactaatg ataatcctac 240 
caccctttgc tgactctctt tttggactca gcccgcctgc acccaggtga aataaacagc 300 
catgttgctc acat 314 

<210> 11 
<211> 309 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 11 

tgtcaggcct ctgagcccaa gctaagccat catatcccgt gacctgcata tacatccaga 60 
tggcctgaag caactgaaga tccacaaagg aagtgaaaat agccttaact gatgacattc 120 
caccattgtg atttgttcct gccccacgct aactgatacc atatattctt cccccgccct 180 
tgagaatgta ctttgtacac ctatcccaaa cctataagaa ctaatgafaa tccaccaccc 240 
tttgctgact ctctttttgg actcagcccg cctgcaccca ggtgaaataa acagccatgt 300 
tgctcacat 309 

<210> 12 
<211> 314 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 12 

tgtcaggcct ctgagcccaa gctaagccat caaatcccct gtgacctaca cgtgtacatc 60 
cagatgacct gaagcaactg aagatccaca aaagaagtga aagtagcctt aactgatgac 120 
attccaccat tgtgatttgt tcctgcccta cgctagctga taccatatat tcttcccccg 180 
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cccttgagaa tgtactttgt acacctatcc caaacctata agaactaatg ataatcctac 240 
caccctttgc tgactctctt tttggactca gcccgcctgc acccaggtga aataaacagc 300 
catgttgctc acat 314 



<210> 13 
<211> 341 
<212> DNA 

<213> Human, endogenous retrovirus 
<400> 13 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccga ctaagaatcc 60 
ctaagcctag ctgggaaggt gaccgcatcc acctttaaac acggggctcg caacttagct 12 0 
cacacccaac caatcaggta gtaaagaggg ctcactaaaa tgctaattag gcaaagacag 180 
gaggtaaaga aatagccaat catctattgc ctgagagcac agcaggaggg acaatgatcg 240 
ggatataaac ccaagtcttc gagccggcaa tggctacctt ctttgggtcc cctccctttg 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg c 341 



<210> 14 
<211> 341 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 14 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccga ctaagaattc 60 
ctaagcctag ctgggaaggt gaccgcatcc atctttaaac atggggcttg caacttaact 120 
catatctgac caatcaggta gtaaagagag ctcactaaaa tgctaattag gctaaaacag 180 
gaggcaaaga agtagccaat catctgttgc ctgacagcac agcaggaggg acaatgatcg 240 
ggatataaac ccaggcattc gagccagcta cagctaccct ctttgggtcc cctccctttg 300 
tatgggagct ctgtcttcac tctattaaat cttgcaactg c 341 



<210> 15 
<211> 322 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 15 

tgttgagatg ggggactgag agacaggact acctggattt cctaggccga ctaagaatcc 60 
ctaagcctag ctgggaaggt gaccacatcc acctttaaac acagggcttg caacttagct 12 0 
cacacttgac cagtcaggta gtaaagagag ctcactaaaa tgctaattag gctaaaacag 180 
gaggtaaaga aatagacaat catctatcac ctgagagcac agtgggaggg acaatgatcg 240 
gcatataaac ccaggcattc gagccagcaa cagcaacccc ctttgggagc tctgttttca 300 
ctctattaaa tcttgcaact gc 322 



<210> 16 
<211> 343 

<212> DNA t 
<213> Human endogenous retrovirus 

<400> 16 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccaa ctaagaatcc 60 
ctaagcctag ctgggaaggt gactacaccc acctttaaac atggggcttg caacttagct 120 
cacacccaac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 18 0 
gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 240 
ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 17 
<211> 343 
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<212> DNA 

<213> Human endogenous retrovirus 
<400> 17 

tgttgagatg ggggactgag agacaggact agctggattc cctaggccga ctaagaatcc 60 

ctaagcctag ctgggaaggt gaccacatcc acctttaaac acggggcttg caacttagct 120 

catacccaac aaatcaggta gtaaagagag ctcactaaaa tactgattag gcgaaaacag 180 

gaggtaagga aacagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 240 

ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 

tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 18 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 18 

tgttgagatg ggggactgag agacaggact agttggattt cctaggctgg ctaagaatcc 60 
ctaagcctag ctgggaaatt gaccacgtcc acctttaaac acggggcttg caatttagct 12 0 
cacacccgac caatcaggta gtaaagggag ctcactaaaa tgctaattag ggaaaaacag 180 
gaggtaaaga agtagccaat catctatcgc ctgagagcac aacaggaggg acaatgatca 240 
ggatataaac ccaggcattc aagccagcgg tggctaccct ctttgggtcc cctccctttg 300 
tatgggagcc ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 19 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 19 

tgttgagatg ggggactgag agacaggact agttggattt cctaggccgg ctaagaatcc 60 
ctaagcctag ctgggaaatt gaccacgtcc acctttaaac acggggcttg caatttagct 120 
cacacccgac caatcaggta gtaaagggag ctcactaaaa tgctaattag ggaaaaacag 180 
gaggtaaaga agtagccaat catctatcgc ctgagagcac aacaggaggg acaatgatca 240 
ggatataaac ccaggcattc aagccagcgg tggctaccct ctttgggtcc cctccctttg 300 
tatggaagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 20 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 20 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccaa ctaagaatcc 60 
ctaagcctag ctgggaaggt gactacaccc acctttaacc actaggcttg caacttagct 120 
cacacccgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 180 
gaggtagaga aatagccaat catctatcgc ctgagagcac agcaggaggg acaatgatcc 240 
ggatataaac ccaagcattc gagccagcaa tggctaccct ctttgtgtcc cctccctttg 300 
tatgggagct ctattttcac tctattaaat cttgcaactg caa 343 



<210> 21 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 21 

tgttgagatg ggggactgag agacaggact agctggattt cctaggctga ctaagaatcc 60 
ctaagcctag ctgggaaggt gaccgcatcc atctttaaac atggggcttg caacttaact 120 
catatctgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 18 0 
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gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 240 
ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 3 00 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 22 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 22 

tgttgagatg ggggactgag agacaggact agctggattt cctaggctga ctaagaatcc 6 0 
ctaagcctag ctgggaaggt gactacaccc acctttaacc actaggcttg caacttagct 120 
cacacccgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 180 
gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 240 
ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 23 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 23 

tgttgagatg ggggactgag agacaggact agttggattt cctaggctgg ctaagaatcc 5 0 
ctaagcctag ctgggaaatt gaccacgtcc acctttaaac acggggcttg caatttagct 120 
cacacccgac caatcaggta gtaaagggag ctcactaaaa tgctaattag ggaaaaacag 180 
gaggtaaaga agtagccaat catctatcgc ctgagagcac aacaggaggg acaatgatca 240 
ggatataaac ccaggcattc aagccagcgg tggctaccct ctttgggtcc cctccctttg 300 
tatggaagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 24 
<211> 343 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 24 

tgttgagatg ggggactgag agacaggact acctggattt cctaggccaa ctaagaatct 60 
ctaagcctag ctgggaaggt gaccacatcc acctttaaac acagggcttg caacttagct 120 
cacacccgac ccatcaggta agaaagagag cccgctaaaa tgctaattag gcaaaaacag 180 
gaggtaaaga aatagtcaat catctattgc ctgagagcac agcgggaggg acaatgatca 240 
ggatataaac ccaggcattc gagccggcaa cgactaccct ctttgggtcc cctccctttg 300 
tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 

<210> 25 

<211> 343 

<212> DNA ^ 

<213> Human endogenous retrovirus 

<400> 25 

tgttgagatg ggggactgag agacaggact agctggattt cctaggccaa ctaagaatcc 60 

ctaagcctag ctgggaaggt gactacaccc acctttaacc actaggcttg caacttagct 120 

cacacccgac caatcaggta gtaaagagag cttgctaaaa tgctaattag gcaaaaacag 180 

gaggtaaaga aatagccagt catctatcgc ctgacagcac aaggggcggg acaatgatca 240 

ggatataaac tcaggcattc aagccagcaa tggctaccca ctttgggtcc cctcccattt 300 

tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 26 
<211> 343 



<212> DNA 

<213> Human endogenous retrovirus 
<400> 26 

tgttgagatg ggggactgag aaacaggact agcaggattt cctaggccga ttaagaatcc 60 

ctaagcctag atgggaagtt gaccacatcc acctttaaac acggggcttg caactcagct 12 0 

cacacccgac ccatcaggta agaaagagag cccgctaaaa tgctaattag gcaaaaacag 180 

gaggtaaaga aatagccaat catctattgc ctgagagcac agcgggaggg acaatgatca 240 

ggatataaac ccaggcattc gagccggcaa cgactaccct ctttgggtcc cctccctttg 300 

tatgggagct ctgttttcac tctattaaat cttgcaactg caa 343 



<210> 27 
<211> 619 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 27 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 60 

cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 120 

ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 

ttacgggtgt cgggctgggg gacggtcagg tctttccctt cccacgaggc catatttcag 240 

actatcacat ggggagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 300 

gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 360 

gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcaccgcc cttaatccat 420 

tcaaccctga gttgacacag cacacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 540 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 

tttccccctt ttcttttcg 619 



<210> 28 
<211> 620 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 28 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 60 

cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 120 

ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 

ttacgggtgt cgggctgggg gacggtcagg tctttccctt cccacgaggc catatttcag 240 

actatcacat ggggagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 300 

gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 360 , 

gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcaccgcc cttaatccat 420 

tcaaccctga gttgacacag cacacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 540 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 

tttccccctt ttcttttcga 620 



<210> 29 
<211> 624 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 29 

gcgaccggtg gatcccgggc ccgcggtacc gtcgactgca gaattcatgg agcatacaat 60 
cgggttttat accgagacat tccattgccc agggacaggc aggagacaga tgccttcctc 120 
ttgtctcaac tgcaagaggc attccttcct cttatactaa tcctcctcag cacagaccct 180 
ttacgggtgt cgggctgggg gatggtcagg tctttccctt cccacgaggc catatttcag 240 
actatcacat gggaagaaac cttggacaat acctggcttt cctaggcaga ggtccctgcg 300 
gccttccgca gtttttgtgt cctgggtact tgagattagg gagtggtgat gactcttaag 360 
gagcatgctg ccttcaagca tctgtttaac aaagcacatc ctgcactgcc cttaatccat 420 
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tcaaccctga gttgacacag cgcacgtttc agagagcacg gggttggggg taaggtcata 480 

gattaacaga atctcaaggc agaagaattt ttcttaacac ataacaaaat ggagtctccc 540 

atgtctactt ctttctacac agacacagta acaatctgat ctctcttgct tttccccaca 600 

tttccccctt ttcttttcga caaa 624 



<210> 30 
<211> 646 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 30 

gcgaccggtg 

cgggttttat 

ttgtctcaac 

ttacgggtgt 

actatcacat 

gccttccgca 

gagcatgctg 

tcaaccctga 

gattaacaga 

atgtctactt 

tttccccctt 



gatcccgggc 
accgagacat 
tgcaagaggc 
cgggctgggg 
ggggagaaac 
gtttttgtgt 
ccttcaagca 
gttgacacag 
atctcaaggc 
ctttctacac 
ttcttttcga 



ccgcggtacc 
tccattgccc 
attcc ttcct 
gacggtcagg 
cttggacaat 
cctgggtact 
tctgtttaac 
cacacgtttc 
agaagaattt 
agacacagta 
caaaaccgcc 



gtcgactgca 
agggacaggc 
cttatactaa 
tctttccctt 
acctggcttt 
tgagattagg 
aaagcacatc 
agagagcacg 
ttcttaacac 
acaatctgat 
atctcgagat 



gaattcatgg 
aggagacaga 
tcctcctcag 
cccacgaggc 
cctaggcaga 
gagtggtgat 
ctgcaccgcc 
gggttggggg 
ataacaaaat 
ctctcttgct 
ctgagt 



agcatacaat 
tgccttcctc 
cacagaccct 
catatttcag 
ggtccctgcg 
gactcttaag 
cttaatccat 
taaggtcata 
ggagtctccc 
tttccccaca 



120 
180 
240 
300 
360 
420 
480 
540 
600 
646 



<210> 31 
<211> 672 
<212> DNA 

<213> Human endogenous retrovirus- 



<400> 31 

gtcccacctc 

gttttatacc 

tctcaactgc 

cgggtgtcgg 

atcacatggg 

ttccgcagtt 

catgctgcct 

accctgagt t 

taacagaatc 

tctacttctt 

cccccttttc 

cgccctatag 



cagccctaag 
gagacattcc 
aagaggcatt 
gctgggggac 
gagaaacctt 
tttgtgtcct 
tcaagcatct 
gacacagcac 
tcaaggcaga 
tctacacaga 
ttatccatca 
tg 



gcggtttttc 
attgcccagg 
ccttcctctt 
ggtcaggtct 
ggacaatacc 
gggtacttga 
gtttaacaag 
acgtttcaga 
agaatttttc 
cacagtaaca 
cactggcggc 



cctatctcag 
gacaggcagg 
atactaatcc 
ttcccttccc 
tggctttcct 
gattagggag 
gcacatcctg 
gagcacgggg 
ttaacacata 
atctgatccc 
cgctcgagca 



tagatggagc 
agacagatgc 
tcctcagcac 
acgaggccat 
aggcagaggt 
tggtgatgac 
caccgccctt 
ttgggggtaa 
acaaaatgga 
tcttgctttt 
tgcatctaga 



atacaatcgg 
cttcctcttg 
agacccttta 
atttca'gact 
ccctgcggcc 
tcttaaggag 
aatccattca 
ggtcatagat 
gtctcccatg 
ccccacattt 
gggcccaatt 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
672 



<210> 32 
<211> 593 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 32 

agtagatgga 

ggagacagat 

cctcctcagc 

ccacgaggcc 

ctaggcagag 

gagtggtgat 

ctgcaccgcc 

gggttggggg 

ataacaaaat 

ctctcttgct 



gcatacaatc 
gccttcctct 
acagaccctt 
atatttcaga 
gtccctgcgg 
gactcttaag 
cttaatccat 
taaggtcata 
ggagtctcct 
tttccccaca 



gggttttata 
tgtctcaact 
tacaggtgtc 
ctatcacatg 
ccttctgcag 
gagcatgctg 
tcaaccctga 
gattaacaga 
atgtctactt 
tttccccctt 



ccgagacatt 
gcaagaggca 
gggctggggg 
gggagaaacc 
tttttgtgtc 
ccttcaagca 
gttgacacag 
atctcaaggc 
ctttctacac 
ttcttttcga 



ccattgccca 
ttccttcctc 
acggtcaggt 
ttggacaata 
cctgggtact 
tctgtttaac 
cacatgtttc 
agaagaattt 
agacacagta 
caaaaccgcc 



gggacaggca 
ttttactaat 
ctttcccttc 
cctggctttc 
tgagattagg 
aaagcacatc 
agagagcacg 
ttcttagcac 
acaatttgat 
ate 



60 

120 

180 

240 

300 

360 

420 

480 

540 

593 



<210> 33 
<211> 943 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 33 

tgtgggcgaa ggattaccca ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 

ataatataga aaatagctag aataagaata gttataataa aaattagata tacacatgat 120 

catggacatt accaatcatt actacaaaca ttgttaatca ttagctttta atattactct 180 

ttgttttatt actaatataa ccaaggaata accggtagca tacggtcagg tgctgaaggg 240 

acattgtgag aagtgaccta gaaggcaaga ggtgagcctt ctgtcacgcc tgcataagga 3 00 

cagcttgagg gctccttggt caagctgtaa caccagtgcc tgggaaggca ccgttactta 360 

gcagaccatg aaagggagtc tccattcctt ggaggagtca gggaaacact atgctccacc 420 

agcttcttgt gtatccagcc ctgcccacag tcatccagag gcataaaccc ctccctgtgg 480 

tgctgtgctt caatggccat gcttcttgtc cactttcatg ttcctcctgt actcctggtt 540 

cctctttgaa gttcgtagaa gataatggta gaagaaatag tgaaagtctt tgatctttct 600 

tataagtgca tagaagaaaa cactgatgta tgcctgcctt ccctctctgc ttcagctacc 660 

taaaaggaaa ggcccccttt cccatgatca catgacttgc ctgaccttat caatcacttg 720 

gaggactcac cctccttacc ctgtcccttt gtcttgtatg caataaatat cagcacgccc 780 

agccattcgg ggccactact ggtctccgca acttggtggt agtggtaccc tgggcccagc 840 

tgttttctct ttatctcttt tgtcttgtgt ctttatttct tacaatctct catctctgca 900 

catggggaga acaccggcaa agcccgtagg gctggacctt aca 943 

<210> 34 
<211> 389 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 34 

aaacccctcc ctgtggtgct gtgcttcaat ggccatgctt cttgtccact ttcatgttcc 60 

tcctgtactc ctggttcctc tttgaagttc gtagaagata atggtagaag aaatagtgaa 120 

agtctttgat ctttcttata agtgcataga agaaaacact gatgtatgcc tgccttccct 180 

ctctgcttca gctacctaaa aggaaaggcc ccctttccca tgatcacatg acttgcctga 240 

ccttatcaat cacttggagg actcaccctc cttaccctgt ccctttgtct tgtatgcaat 300 

aaatatcagc acgcccagcc attcggggcc actactggtc tccgcaactt ggtggtagtg 360 
gtaccctggg cccagctgtt ttctcttta 389 

<210> 35 
<211> 858 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 35 

tgtgggcgga agagtaccta ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 

ataataaaga aaatagaata agaatagtca taatacaaat tagatacagc gatgatcatg 120 

aacaattatc catcattatt ataaacatta ttaatcatta gcttttaata ttactctgtt 180 

gcattaataa tataacctag gaataaccgg caggtatagg gtcaggtgct gaagggacat 240 

tgtgagaagt gaatagaagg caagagggga gccttctgtc atgcccg^at aagggccgct 300 

tgagggcccc ttggtcaagc ggtaacgcca gtgtctggga aggcacccgt tactgagcag 3 60 

accgggaaag ggagtctcct ttccttggag gagtcaggga acgctctgct ccaccagctt 420 

cttgtgggag gctggatgtt acccaggcct gcctgcagtc atccggaggc ctgaacccct 480 

ccctgtggtg cttcaatggt cacgttcctt gtccactttc atgctccttc cgtactcctg 540 

gttcctcttt gaagttcgta gtagatagcg gtagaagaaa tagtgaaagt cttaaagtct 600 

ttgatcttat aagttcatag aagaaaacgc tgatgcctgc cgccttctct ctctgcttca 660 

gctacctaag agggaagggc ccgctgtcct gtgatcaggt gacttgcttc accttgtcaa 72 0 

tcacttagaa gactgaccct ccttatcctg cccccttgtc ttgtatgcaa taaatatcag 780 

cgagcccagc cgttcagggc cactaccggt ctccgtgtct ttgtggtagt ggtccccggg 840 
cccagctgtt ttctcttt 858 
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<211> 386 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 35 

gaacccctcc ctgtggtgct tcaatggtca cgttccttgt ccactttcat gctccttccg 60 

tactcctggt tcctctttga agttcgtagt agatagcggt agaagaaata gtgaaagtct 120 

taaagtcttt gatcttataa gttcatagaa gaaaacgctg atgcctgccg ccttctctct 180 

ctgcttcagc tacctaagag ggaagggccc gctgtcctgt gatcaggtga cttgcttcac 240 

cttgtcaatc acttagaaga ctgaccctcc ttatcctgcc cccttgtctt gtatgcaata 300 

aatatcagcg agcccagccg ttcagggcca ctaccggtct ccgtgtcttt gtggtagtgg 360 

tccccgggcc cagctgtttt ctcttt 386 



<210> 37 
<211> 844 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 37 

tgtgggtgga ggattaccca ggtgccaagg caagagactg aaggcacaaa ctgtttcagt 60 
ataataaaaa aaatagaata agaatagtca taatacaaat tagatataga gatgatcatg 12 0 
gacaattagc aatcactatt aatctttagc ttttaatatt actctttgtt gcattactaa 180 
tataacctag gaataaccgg tgggtatagg gtcaggtgct gaagggacat tgtgtgaagt 240 
gacctggaag gcaagaggtg agccctctgt cacgcccaca taagggccgc ttgagggctc 300 
cttggtcaag tggtaacgcc agtgtctggg aatgcacccg ttaattagca gaccgcgaaa 360 
gggagtctcc tttccttgga agagttgggg aacactctgc tccaccagct tcttgtggaa 42 0 
ggctggatat tatccaggcc tgcgcgcagt catccggagg cttaaaccct tccctgtggt 480 
gctgtgcttc aatggtccca ctccttgtcc actttcatgc tcctcccgta ctcctggttc 540 
ctctttgaag agcgcagtag atagcggtag aagaaatagt gaaagtctta aagtcttcga 600 
tctttcttac aagtgcagag aagaaaacgc tgacatatgc tgccttccct ctctgcttcg 660 
gctacctaaa agggaagggc cgcctatcct gtaatcacat gacttgcttc accttgtcaa 720 
tcacttagaa gattcactct ccttaccctg cccccttgtc ttgtatgcaa taaatatcag 780 
tgaccccagc cgttcagggc cactactggt ctccgcgtct tgatggtagt ggtcaccccg 840 
gccc 844 



<210> 38 
<211> 381 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 38 

aaacccttcc ctgtggtgct gtgcttcaat ggtcccactc cttgtccact ttcatgctcc 60 
tcccgtactc ctggttcctc tttgaagagc gcagtagata gcggtagaag aaatagtgaa 12 0 
agtcttaaag tcttcgatct ttcttacaag tgcagagaag aaaacgctga catatgctgc 180 
cttccctctc tgcttcggct acctaaaagg gaagggccgc ctatcctgta atcacatgac 240 
ttgcttcacc ttgtcaatca cttagaagat tcactctcct taccctgccc ccttgtcttg 300 
tatgcaataa atatcagtga ccccagccgt tcagggccac tactggtctc cgcgtcttga 360 
tggtagtggt caccccggcc c t 381 



<210> 39 
<211> 859 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 39 

tgtgggtgga ggattaccca ggtgccaagg caagagactg aaggcacaaa ctgtttcagt 60 
ataataaaaa aaatagaata agaatagtca taatacaaat tagatataga gatgatcatg 120 
gacaattagc aatcactatt aatctttagc ttttaatatt actctttgtt gcattactaa 180 
tataacctag gaataaccgg tgggtatagg gtcaggtgct gaagggacat tgtgagaagt 240 
gacctggaag gcaagaggtg agccctctgt cacgcccaca taagggccgc ttgagggctc 300 
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cttggtcaag tggtaacgcc agtgtctggg aatgcacccg ttaattagca gaccgcgaaa 360 

gggagtctcc tttccttgga agagttgggg aacactctgc tccaccagct tcttgtggaa 42 0 

ggctggatat tatccaggcc tgcgcgcagt catccggagg cttaaaccct tccctgtggt 480 

gctgtgcttc aatggtccca ctccttgtcc actttcatgc tcctcccgta ctcctggttc 540 

ctctttgaag agcgcagtag atagcggtag aagaaatagt gaaagtctta aagtcttcga 600 

tctttcttac aagtgcagag aagaaaacgc tgacatatgc tgccttccct ctctgcttcg 660 

gctacctaaa agggaagggc cgcctatcct gtaatcacat gacttgcttc accttgtcaa 720 

tcacttagaa gattcaccct ccttaccctg cccccttgtc ttgtatgcaa taaatatcag 780 

tgaccccagc cgttcagggc cactactggt ctccgcgtct tgatggtagt ggtcaccccg 840 

gcccaggtgt tttttcttt 859 



<210> 40 
<211> 396 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 40 

aaacccttcc ctgtggtgct gtgcttcaat ggtcccactc cttgtccact ttcatgctcc 60 
tcccgtactc ctggttcctc tttgaagagc gcagtagata gcggtagaag aaatagtgaa 12 0 
agtcttaaag tcttcgatct ttcttacaag tgcagagaag aaaacgctga catatgctgc 180 
cttccctctc tgcttcggct acctaaaagg gaagggccgc ctatcctgta atcacatgac 240 
ttgcttcacc ttgtcaatca cttagaagat tcaccctcct taccctgccc ccttgtcttg 300 
tatgcaataa atatcagtga ccccagccgt tcagggccac tactggtctc cgcgtcttga 3 60 
tggtagtggt caccccggcc caggtgtttt ttcttt 396 



<210> 41 
<211> 966 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 41 

tgtgggtgga ggattaccca ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 
ataataaaga aaatggttag aataagaata gtcataatac aaattagata tagagatgat 12 0 
catggacaat tatcaatcat tattataaac attattaatc attagctttt aatattactc 180 
tttgttgcat tactaatata acctaggaat aaccggtggg tatagggtca ggtgctgaaa 240 
ggacattggg agaagtgacc tagaaggcaa gaggtgagtc ttctgtcacg cccgcataag 300 
ggttgcttga gggctccttg gtcaagtggt aacgccggtg tctgggaagg cacctgttac 360 
ttagccgacc acgaaaggga gtctcctttc cttggaggag tcagggcgca ctctgctcca 42 0 
ccagcttctt gtggaaggct ggatattatc caggcctgcc cgcagtcatc cggaggccta 480 
aacccctccc tgtggtgctg tgcttcaatg ggcacactcc tcgtccactt tcatgttcct 540 
cccatactcc tggtttctct ttgaagttcg tagtagatag tggtagaagg aatagggaaa 600 
atcttaaagt gtttgatctt tcttataagt gcatagaaga aaacgctgac atatgctgcc 660 
ttctctgtct gcttcagcta cctaagaggg aagggccccc tgtccagtga tcacgtgact 72 0 
tgcttcacct tgtcaatcac ttagaagatt caccctcctt accctgcccc cttgtcttgt 780 
atgcaataaa tatcagtgca cccagccttt cggggccact taccggtctc cacgtcttgg 840 
tggtagtggt cccccgggcc cagctgtttt ctctttatct ctttgtcttg tgtcttattt 900 
attacaatct ctcgtctccg cacacaggga gaacacccgc taagctccgt agggctggac 960 
cctaca ^ 966 



<210> 42 
<211> 398 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 42 

aaacccctcc ctgtggtgct gtgcttcaat gggcacactc ctcgtccact ttcatgttcc 60 
tcccatactc ctggtttctc tttgaagttc gtagtagata gtggtagaag gaatagggaa 120 
aatcttaaag tgtttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 
cttctctgtc tgcttcagct acctaagagg gaagggcccc ctgtccagtg atcacgtgac 240 
ttgcttcacc ttgtcaatca cttagaagat tcaccctcct taccctgccc ccttgtcttg 300 
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tatgcaataa atatcagtgc acccagcctt tcggggccac ttaccggtct ccacgtcttg 360 
gtggtagtgg tcccccgggc ccagctgttt tctcttta 398 

<210> 43 
<211> 938 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 43 

tgtgggtgga ggattaccca ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 

ataataaaga aaatggttag aataagaata gtcataatac aaattagata tagagatgat 120 

catggacaat tatcaatcat tattataaac attattaatc attagctttt aatattactc 180 

tttgttgcat tactaatata acctaggaat aaccggtggg tatagggtca ggtgctgaag 240 

ggacattggg agaagtgacc tagaaggcaa gaggtgagtc ttctgtcacg cccgcataag 300 

ggttgcttga gggctccttg gtcaagtggt aacgccggtg tctgggaagg cacctgttac 360 

ttagccgacc acgaaaggga gtctcctttc cttggaggag tcagggcaca ctctgctcca 42 0 

ccagcttctt gtggaaggct ggatattatc caggcctgcc cgcagtcatc cggaggccta 480 

aacccctccc tgtggtgctg tgcttcaatg ggcacactcc tcgtccactt tcatgttcct 540 

cccatactcc tggttcctct ttgaagttcg tagtagatag tggtagaagg aatagggaaa 600 

atcttaaagt gtttgatctt tcttataagt gcatagaaga aaacgctgac atatgctgcc 660 

ttctctgtct gcttcagcta cctaagaggg aagggccccc tgtccagtga tcacgtgact 720 

tgcttcacct tgtcaatcac ttagaagatt caccctcctt accctgcccc cttgtcttgt 780 

atgcaataaa tatcagtgca cccagccttt cggkkcactt accggtctcc acgtcttggt 840 

ggtagtggtc ccccggccca gctgttttct ctttatctct ttgtcttgtg tcttatttat 900 

tacaatctct cgtctccgca cacagggaga acacccgc 938 

<210> 44 
<211> 396 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 44 

aaacccctcc ctgtggtgct gtgcttcaat gggcacactc ctcgtccact ttcatgttcc 60 

tcccatactc ctggttcctc tttgaagttc gtagtagata gtggtagaag gaatagggaa 120 

aatcttaaag tgtttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 

cttctctgtc tgcttcagct acctaagagg gaagggcccc ctgtccagtg atcacgtgac 240 

ttgcttcacc ttgtcaatca cttagaagat tcaccctcct taccctgccc ccttgtcttg 300 

tatgcaataa atatcagtgc acccagcctt tcggkkcact taccggtctc cacgtcttgg 360 

tggtagtggt cccccggccc agctgttttc tcttta 396 

<210> 45 
<211> 963 
<212> DNA 

<213> Human endogenous retrovirus 
<400> 45 

tgtgggcgaa agattaccta ggtgccgagg caagagactg aaggcacaaa ctgtttcagt 60 
ataataaaga aaatagttaa aataagaata gttataatac aaattagata tagagatgat 120 
catggacaat tatcaatcat tattataaac attaatcatt agcttttaat attactcttt 180 
gttgctttac taatataacc taggaataac cggtgggtat agggtcaggt gttgacggga 240 
tattgtgaga agtgacctag aaggcaagag gtgagccttc tgtcacgccc acataagggc 3 00 
cgcttgaggg ctctttggtc aagtggtaac gccagtgtct gtgaaggcac ctgttactta 360 
gcagaccgcg aaagggagtc tcctttcctt ggaggagtca gggaacactc tgctccacca 420 
gcttcttgtg gaaggctgga tattatctag gcctgcccgc agtcatctgg aggcctaaac 480 
ccctccctgt ggtgctgtgc ttcagtggtc actctccttg tccactttca tgttcctccc 540 
gtactcctgg ttcctctttg aagttcgtag tagatagcag tagaagaaat agtgaaagtc 600 
ttaaagtatt tgatctttct tataagtgca tagaagaaaa cgctgacata tgctgccttc 660 
tctatctctg cggtggctac ctaaaaggga agggccccct gtcccatgat catgtgactt 720 
gcttcacctt atcacttaga agattcatcc tccttaccct gcgccccctc gtcttgtatg 780 
caataaatat cagcacgccc agtcgtttga ggccactgcc ggtctccgcg tcttggtggt 840 
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agtggtcccc cgggcccagc tattgtctct ttatctcttt gtcttgtgtc tttatttatt 900 
■4 acaatctctt gtctctgcac acagggagaa cacctgctaa gccccgtagg actggaccct 960 
aca 963 



<210> 46 
<211> 397 
<212> DNA 

<213> Human endogenous retrovirus 



<400> 46 

aaacccctcc ctgtggtgct gtgcttcagt ggtcactctc cttgtccact ttcatgttcc 60 

tcccgtactc ctggttcctc tttgaagttc gtagtagata gcagtagaag aaatagtgaa 120 

agtcttaaag tatttgatct ttcttataag tgcatagaag aaaacgctga catatgctgc 180 

cttctctatc tctgcggtgg ctacctaaaa gggaagggcc ccctgtccca tgatcatgtg 240 

acttgcttca ccttatcact tagaagattc atcctcctta ccctgcgccc cctcgtcttg 300 

tatgcaataa atatcagcac gcccagtcgt ttgaggccac tgccggtctc cgcgtcttgg 3 60 

tggtagtggt cccccgggcc cagctattgt ctcttta 397 



<210> 47 
<211> 489 
<212> DNA 

<213> Human endogenous retrovir 



<400> 47 

tgttcaattc tttgccttct acttttaaac 
tcacctgctc cactctgact cattctgatc 
ctgatccact gtgactcatt ccgattaccc 
tcctgctctg ccataaccat ttttcccgcc 
tagccaattg gaattagttt agcctgtgcg 
agcagcaggg gccacatgtg tcaggaataa 
gctcaccatt gctccatctg tgagggcaca 
gaagaaaaaa aagaacattt tatattcaag 
tttataaca 



ttaacttcct cataaagcaa cctttttcaa 60 
acctgctcca ccctgactca ttccgatcac 120 
gctccaccct gactcattct gattctgatt 180 
aaaccactca ccctgtcact ctctttaaat 240 
gtctaaccct agccaatagg ggactgacac 3 00 
gacccccttc ccctccctgt ccagatgtgt 360 
cccttctata gaagtaaatt gccttgctga 420 
tcctatttct tttgctgcac cgaaacttta 480 

489 
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